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Preface to the first edition 


One of the most popular upper level mathematics courses taught 
at Caltech for very many years was H. J. Ryser’s course Combina- 
torial Analysis, Math 121. One of Ryser’s main goals was to show 
elegance and simplicity. Furthermore, in this course that he taught 
so well, he sought to demonstrate coherence of the subject of com- 
binatorics. We dedicate this book to the memory of Herb Ryser, 
our friend whom we admired and from whom we learned much. 


Work on the present book was started during the academic year 
1988-89 when the two authors taught the course Math 121 together. 
Our aim was not only to continue in the style of Ryser by showing 
many links between areas of combinatorics that seem unrelated, 
but also to try to more-or-less survey the subject. We had in mind 
that after a course like this, students who subsequently attend a 
conference on “Combinatorics” would hear no talks where they are 
completely lost because of unfamiliarity with the topic. Well, at 
least they should have heard many of the words before. We strongly 
believe that a student studying combinatorics should see as many 
of its branches as possible. 


Of course, none of the chapters could possibly give a complete 
treatment of the subject indicated in their titles. Instead, we cover 
some highlights—but we insist on doing something substantial or 
nontrivial with each topic. It is our opinion that a good way to 
learn combinatorics is to see subjects repeated at intervals. For 
this reason, several areas are covered in more than one part of the 
book. For example, partially ordered sets and codes appear several 
times. Enumeration problems and graph theory occur throughout 
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the book. A few topics are treated in more detail (because we like 
them) and some material, like our proof of the Van der Waerden 
permanent conjecture, appears here in a text book for the first 
time. 

A course in modern algebra is sufficient background for this book, 
but is not absolutely necessary; a great deal can be understood 
with only a certain level of maturity. Indeed, combinatorics is well 
known for being “accessible”. But readers should find this book 
challenging and will be expected to fill in details (that we hope 
are instructive and not too difficult). We mention in passing that 
we believe there is no substitute for a human teacher when trying 
to learn a subject. An acquaintance with calculus, groups, finite 
fields, elementary number theory, and especially linear algebra will 
be necessary for some topics. Both undergraduates and graduate 
students take the course at Caltech. The material in every chapter 
has been presented in class, but we have never managed to do all 
the chapters in one year. 

The notes at the end of chapters often include biographical re- 
marks on mathematicians. We have chosen to refrain from any 
mention of living mathematicians unless they have retired (with 
the exception of P. Erdés). 

Exercises vary in difficulty. For some it may be necessary to 
consult the hints in Appendix 1. We include a short discussion of 
formal power series in Appendix 2. 

This manuscript was typeset by the authors in A,\,4S-TRx. 


J.H.v.L., R.M.W. 
Eindhoven and Pasadena, 1992 


Preface to the 2nd edition 


The favorable reception of our book and its use for a variety of 
courses on combinatorial mathematics at numerous colleges and 
universities has encouraged us to prepare this second edition. We 
have added new material and have updated references for this ver- 
sion. A number of typographical and other errors have been cor- 
rected. We had to change “this century” to “the last century” in 
several places. 


The new material has, for the most part, been inserted into the 
chapters with the same titles as in the first edition. An exception 
is that the material of the later chapters on graph theory has been 
reorganized into four chapters rather than two. The added material 
includes, for example, discussion of the Lovasz sieve, associative 
block designs, and list colorings of graphs. 


Many new problems have been added, and we hope that this last 
change, in particular, will increase the value of the book as a text. 
We have decided not to attempt to indicate in the book the level 
of difficulty of the various problems, but remark again that this 
can vary greatly. The difficulty will often depend on the experience 
and background of the reader, and an instructor will need to decide 
which exercises are appropriate for his or her students. We like the 
idea of stating problems at the point in the text where they are 
most relevant, but have also added some problems at the end of 
the chapters. It is not true that the problems appearing later are 
necessarily more difficult than those at the beginning of a chapter. 
A number of the hints and comments in Appendix 1 have been 
improved. 
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Preparation of the second edition was done during a six-month 
visit to the California Institute of Technology by the first author 
as Moore Distinguished Scholar. He gratefully acknowledges the 
support of the Moore Foundation. 


1 
Graphs 


A graph G consists of a set V (or V(G)) of vertices, a set E (or 
E(G)) of edges, and a mapping associating to each edge e € E(G) 
an unordered pair x,y of vertices called the endpoints (or simply 
the ends) of e. We say an edge is incident with its ends, and that 
it joins its ends. We allow x = y, in which case the edge is called 
a loop. A vertex is isolated when it is incident with no edges. 

It is common to represent a graph by a drawing where we repre- 
sent each vertex by a point in the plane, and represent edges by line 
segments or arcs joining some of the pairs of points. One can think 
e.g. of a network of roads between cities. A graph is called planar 
if it can be drawn in the plane such that no two edges (that is, the 
line segments or arcs representing the edges) cross. The topic of 
planarity will be dealt with in Chapter 33; we wish to deal with 
graphs more purely combinatorially for the present. 


edge ends 
a Cie) 
b Y, W 
Cc ine) 
d Zz, Ww 
e€ a 
f x,y 
g Bw 
Figure 1.1 


Thus a graph is described by a table such as the one in Fig. 1.1 
that lists the ends of each edge. Here the graph we are describing 
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has vertex set V = {z,y,z,w} and edge set E = {a,b,c,d,e, f, g}; 
a drawing of this graph may be found as Fig. 1.2(iv). 

A graph is simple when it has no loops and no two distinct edges 
have exactly the same pair of ends. Two nonloops are parallel 
when they have the same ends; graphs that contain them are called 
multigraphs by some authors, or are said to have ‘multiple edges’. 

If an ordered pair of vertices is associated to each edge, we have 
a directed graph or digraph. In a drawing of a digraph, we use an 
arrowhead to point from the first vertex (the tail) towards the sec- 
ond vertex (the head) incident with an edge. For a simple digraph, 
we disallow loops and require that no two distinct edges have the 
same ordered pair of ends. 

When dealing with simple graphs, it is often convenient to iden- 
tify the edges with the unordered pairs of vertices they join; thus 
an edge joining x and y can be called {x,y}. Similarly, the edges 
of a simple digraph can be identified with ordered pairs (x,y) of 
distinct vertices. 


A AH 


i) graph (ii) graph with loop (iii) digraph (iv) multiple edges 


Figure 1.2 


There are several ways to draw the same graph. For example, 
the two graphs of Fig. 1.3 are essentially the same. 

We make this more precise, but to avoid unnecessarily technical 
definitions at this point, let us assume that all graphs are undirected 
and simple for the next two definitions. 

We say two graphs are isomorphic if there is a one-to-one cor- 
respondence between the vertex sets such that if two vertices are 
joined by an edge in one graph, then the corresponding vertices are 
joined by an edge in the other graph. To show that the two graphs 
in Fig. 1.3 are the same, find a suitable numbering of the vertices 
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in both graphs (using 1, 2,3,4,5,6) and observe that the edge sets 
are the same sets of unordered pairs. 


A OD 


Figure 1.3 


A permutation o of the vertex set of a graph G with the property 
that {a,b} is an edge if and only if {o(a),o(b)} is an edge, is called 
an automorphism of G. 


Problem 1A. (i) Show that the drawings in Fig. 1.4 represent the 
same graph (or isomorphic graphs). 

(ii) Find the group of automorphisms of the graph in Fig. 1.4. 
Remark: There is no quick or easy way to do this unless you are 
lucky; you will have to experiment and try things. 


wy & 


Figure 1.4 


The complete graph K, on n vertices is the simple graph that 
has all (5) possible edges. 

Two vertices a and 6 of a graph G are called adjacent if they are 
distinct and joined by an edge. We will use ['(x) to denote the set 
of all vertices adjacent to a given vertex x; these vertices are also 
called the neighbors of x. 
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The number of edges incident with a vertex «x is called the degree 
or the valency of x. Loops are considered to contribute 2 to the 
valency, as the pictures we draw suggest. If all the vertices of a 
graph have the same degree, then the graph is called regular. 

One of the important tools in combinatorics is the method of 
counting certain objects in two different ways. It is a well known 
fact that if one makes no mistakes, then the two answers are the 
same. We give a first elementary example. A graph is finite when 
both E(G) and V(G) are finite sets. We will be primarily con- 
cerned with finite graphs, so much so that it is possible we have 
occasionally forgotten to specify this condition as a hypothesis in 
some assertions. 


Theorem 1.1. A finite graph G has an even number of vertices 
with odd valency. 


PROOF: Consider a table listing the ends of the edges, as in Fig. 
1.1. The number of entries in the right column of the table is twice 
the number of edges. On the other hand, the degree of a vertex x 
is, by definition, the number of times it occurs in the table. So the 
number of entries in the right column is 


(1.1) S— deg(x) = 2|E(Q)|. 


xeEV(G) 


The assertion follows immediately. 


The equation (1.1) is simple but important. It might be called 
the ‘first theorem of graph theory’, and our Theorem 1.1 is its first 
corollary. 

A subgraph of a graph G is a graph H such that V(H) C V(G), 
E(H) C E(G), and the ends of an edge e € E(H) are the same 
as its ends in G. H is a spanning subgraph when V(H) = V(G). 
The subgraph of G induced by a subset S of vertices of G is the 
subgraph whose vertex set is S and whose edges are all the edges 
of G with both ends in S. 

A walk in a graph G consists of an alternating sequence 


ZO, €1, 01, €2,02,---,Uk-1,€k, Vk 
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of vertices x;, not necessarily distinct, and edges e; so that the ends 
of e; are exactly x;_; and x;,i1 = 1,2,...,k. Such a walk has length 
k. If the graph is simple, a walk is determined by its sequence of 
vertices, any two successive elements of which are adjacent. 


If the edge terms €),...,e, are distinct, then the walk is called a 
path from xo to xz. If xo = xz, then a walk (or path) is called closed. 
A simple path is one in which the vertex terms 2%, 71,...,U, are 


also distinct, although we say we have a simple closed path when 
k; > 1 and all vertex terms are distinct except xp = 2x. 

If a path from x to y exists for every pair of vertices x,y of G, 
then G is called connected. Otherwise G consists of a number of 
connected components (maximal connected subgraphs). It will be 
convenient to agree that the null graph with no vertices and no 
edges is not connected. 


Problem 1B. Suppose G is a simple graph on 10 vertices that is 
not connected. Prove that G has at most 36 edges. Can equality 
occur? 


The length of the shortest walk from a to b, if such walks exist, is 
called the distance d(a,b) between these vertices. Such a shortest 
walk is necessarily a simple path. 


Example 1.1. A well known graph has the mathematicians of the 
world as vertices. Two vertices are adjacent if and only if they 
have published a joint paper. The distance in this graph from 
some mathematician to the vertex P. Erdés is known as his or her 
Erd6s-number. 


e0AUO> 


Figure 1.5 


A polygon is the ‘graph of’ a simple closed path, but more pre- 
cisely it can be defined as a finite connected graph that is regular 
of degree 2. There is, up to isomorphism, exactly one polygon P,, 
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with n vertices (often called the n-gon) for each positive integer n. 
The sequence of polygons is shown in Fig. 1.5. 

A connected graph that contains no simple closed paths, i.e. that 
has no polygons as subgraphs, is called a tree. 


Problem 1C. Show that a connected graph on n vertices is a tree 
if and only if it has n — 1 edges. 


Problem 1D. The complete bipartite graph Ky m has n +m ver- 
tices a1,...,@, and b),...,bm, and as edges all mn pairs {aj, b;}. 
Show that 33 is not planar. 


No introduction to graph theory can omit the problem of the 
bridges of K6nigsberg (formerly a city in Prussia). The river Pregel 
flowed through this city and split into two parts. In the river was 
the island Kneiphof. There were seven bridges connecting different 
parts of the city as shown in the diagram of Fig. 1.6. 


Konigsberg 
Figure 1.6 


In a paper written in 1736 by L. Euler (considered the first paper 
on graph theory) the author claims that the following question was 
considered difficult: Is it possible to make a walk through the city, 
returning to the starting point and crossing each bridge exactly 
once? This paper has led to the following definition. A closed path 
through a graph using every edge once is called an Eulerian circuit 
and a graph that has such a path is called an Eulerian graph. 


Theorem 1.2. A finite graph G with no isolated vertices (but pos- 
sibly with multiple edges) is Eulerian if and only if it is connected 
and every vertex has even degree. 
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PRooF: That G must be connected is obvious. Since the path 
enters a vertex through some edge and leaves by another edge, it 
is clear that all degrees must be even. To show that the conditions 
are sufficient, we start in a vertex x and begin making a path. We 
keep going, never using the same edge twice, until we cannot go 
further. Since every vertex has even degree, this can only happen 
when we return to x and all edges from x have been used. If 
there are unused edges, then we consider the subgraph formed by 
these edges. We use the same procedure on a component of this 
subgraph, producing a second closed path. If we start this second 
path in a point occurring in the first path, then the two paths can 
be combined to a longer closed path from x to x. Therefore the 
longest of these paths uses all the edges. 


The problem of the bridges of K6nigsberg is described by the 
graph in Fig. 1.6. No vertex has even degree, so there is no Eulerian 
circuit. 

One can consider a similar problem for digraphs. The necessary 
and sufficient condition for a directed Eulerian circuit is that the 
graph is connected and that each vertex has the same ‘in-degree’ 
as ‘out-degree’. 


Example 1.2. A puzzle with the name Instant Insanity concerns 
four cubes with faces colored red, blue, green, and yellow, in such a 
way that each cube has at least one face of each color. The problem 
is to make a stack of these cubes so that all four colors appear on 
each of the four sides of the stack. In Fig. 1.7 we describe four 
possible cubes in flattened form. 


R R G B 
R/Y|G|B R/Y|B/G BIB|R/Y G|Y|RI|G 
R Y G Y 
cube 1 cube 2 cube 3 cube 4 
Figure 1.7 


It is not a very good idea to try all possibilities. A systematic 
approach is as follows. The essential information about the cubes 
is given by the four graphs in Fig. 1.8. 
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R BR 2B 
cube 1 cube 2 cube 3 cube 4 the eo G 
Figure 1.8 


An edge indicates that the two adjacent colors occur on opposite 
faces of the cube. We obtain a graph G by superposition of the 
four graphs and number the edges according to their origin. It is 
not difficult to see that we need to find in G two subgraphs that 
are regular of degree 2, with edges numbered 1, 2,3, 4 and such that 
they have no edge in common. One of the subgraphs tells us which 
pairs of colors to align on the left side and right side of the stack. 
The other graph describes the colors on front and back. Of course 
it is easy to rotate the cubes in such a way that the colors are where 
we wish them to be. The point of the example is that it takes only 
a minute to find two subgraphs as described above. In this example 
the solution is unique. 


We mention a concept that seems similar to Eulerian circuits but 
that is in reality quite different. A Hamiltonian circuit in a graph 
G is a simple closed path that passes through each vertex exactly 
once (rather than each edge). So a graph admits a Hamiltonian 
circuit if and only if it has a polygon as a spanning subgraph. In 
the mid-19th century, Sir William Rowan Hamilton tried to popu- 
larize the exercise of finding such a closed path in the graph of the 
dodecahedron (Fig. 1.9). 


i 


Cag 


Figure 1.9 
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The graph in Fig. 1.4 is called the Petersen graph (cf. Chapter 21) 
and one of the reasons it is famous is that it is not ‘Hamiltonian’; 
it contains n-gons only for n = 5,6,8,9, and not when n = 7 or 
n= 10. 

By Theorem 1.2, it is easy to decide whether a graph admits an 
Eulerian circuit. A computer can easily be programmed to check 
whether the degrees of a graph are even and whether the graph is 
connected, and even to produce an Eulerian circuit when one exists. 
In contrast to this, the problem of deciding whether an arbitrary 
graph admits a Hamiltonian circuit is likely ‘intractable’. To be 
more precise, it has been proved to be NP-complete—see Garey 
and Johnson (1979). 


Problem 1E. Let Aj,...,A, be n distinct subsets of the n-set 
N := {1,...,n}. Show that there is an element x € N such that 
the sets A;\{x}, 1 <i <n, are all distinct. To do this, form a graph 
G on the vertices A; with an edge with ‘color’ x between A; and A; 
if and only if the symmetric difference of the sets A; and A; is {x}. 
Consider the colors occurring on the edges of a polygon. Show that 
one can delete edges from G in such a way that no polygons are 
left and the number of different colors remains the same. Then use 
1C. (This idea is due to J. A. Bondy (1972).) 


Problem 1F. The girth of a graph is the length of the smallest 
polygon in the graph. Let G be a graph with girth 5 for which all 
vertices have degree > d. Show that G has at least d? + 1 vertices. 
Can equality hold? 

Problem 1G. Show that a finite simple graph with more than one 
vertex has at least two vertices with the same degree. 


Problem 1H. A graph on the vertex set {1,2,...,} is often de- 
scribed by a matrix A of size n, where aj; and aj; are equal to 
the number of edges with ends 7 and 7. What is the combinatorial 
interpretation of the entries of the matrix A?? 


Problem 1I. Let Q := {1,2,...,q}. Let G be a graph with the 
elements of Q” as vertices and an edge between (a1, @2,...,@,) and 
(b;, bo,...,6,) if and only if a; 4 b; for exactly one value of 7. Show 
that G is Hamiltonian. 


10 A Course in Combinatorics 


Problem 1J. Let G be a simple graph on n vertices (n > 3) with 
no vertex of degree n — 1. Suppose that for any two vertices of G, 
there is a unique vertex joined to both of them. 

(i) If x and y are not adjacent, prove that they have the same 
degree. 

(ii) Now show that G is a regular graph. 


Notes. 

Paul Erdés (1913-1996) (cf. Example 1.1) was probably the most 
prolific mathematician of the 20th century with well over 1400 pa- 
pers having been published. His contributions to combinatorics, 
number theory, set theory, etc., include many important results. 
He collaborated with many mathematicians all over the world, all 
of them proud to have Erd6és-number 1, among them the authors 
of this book; see J. W. Grossman (1997). 

Leonhard Euler (1707-1783) was a Swiss mathematician who 
spent most of his life in St. Petersburg. He was probably the most 
productive mathematician of all times. Even after becoming blind 
in 1766, his work continued at the same pace. The celebration in 
1986 of the 250th birthday of graph theory was based on Euler’s 
paper on the Konigsberg bridge problem. Konigsberg is now the 
city of Kaliningrad in Russia. 

For an elementary introduction to graph theory, we recommend 
R. J. Wilson (1979), and J. J. Watkins and R. J. Wilson (1990). 

Sir William Rowan Hamilton (1805-1865) was an Irish mathe- 
matician. He was considered a genius. He knew 13 languages at 
the age of 12 and was appointed professor of astronomy at Trinity 
College Dublin at the age of 22 (before completing his degree). His 
most important work was in mathematical physics. 


References. 

M. Garey and D. S. Johnson (1979), Computers and Intractability; 
A Guide to the Theory of NP-completeness, W. H. Freeman and 
Co. 

J. W. Grossman (1997), Paul Erdés: The Master of Collaboration, 
pp. 467-475 in The Mathematics of Paul Erdés, R. L. Graham 
and J. NeSetril (eds.), Springer-Verlag. 
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2 
Trees 


We come to the first not so easy theorem. It is due to A. Cayley 
(1889). We shall give three different proofs here. Two more proofs 
will occur in later chapters; see Example 14.14 and Example 38.2. 
The first two proofs illustrate a method that is used very often in 
combinatorics. In order to count certain objects that seem hard to 
count, one finds a one-to-one mapping onto a set of other objects 
whose number is easier to determine. 


Theorem 2.1. There aren”? different labeled trees on n vertices. 


The term labeled emphasizes that we are not identifying isomor- 
phic graphs. We have fixed the set of vertices, and two trees are 
counted as the same if and only if exactly the same pairs of vertices 
are adjacent. A spanning tree of a connected graph G is a spanning 
subgraph of G that is a tree. The theorem could have been stated: 
the complete graph K,, has n”~? spanning trees. 


Example 2.1. Here are the 16 labeled trees on four vertices: 


Fi IN 
WGA 
Ae 
Ww2a6e 
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Example 2.2. There are three nonisomorphic trees on five ver- 
tices: 


The number of spanning trees in Ks isomorphic to a specific tree 
T on five vertices is 5! divided by the order of the automorphism 
group of T (why?). Thus there are 5!/4! = 5 trees in Ks isomorphic 
to the first tree above, and 5!/2 = 60 trees isomorphic to either of 
the other two trees, for a total of 125 spanning trees. 


Problem 2A. Find the six nonisomorphic trees on 6 vertices, and 
for each compute the number of distinct spanning trees in K¢ iso- 
morphic to it. 


Before starting the proofs, we make the following observations. 
(Probably the reader has already noticed these things in solving 
Problem 1C.) Firstly, every tree with n > 2 vertices has at least 
two monovalent vertices (vertices of degree 1). This is immediate, 
for example, from Problem 1C and equation (1.1): the sum of the 
degrees dj, dz,...,dn, all of which are at least 1, is 2n—2. Secondly, 
if a monovalent vertex and its incident edge are deleted from a tree, 
the resulting graph is still a tree. Finally, given a tree 7’, if we 
introduce a new vertex x and a new edge joining x to any vertex 
of T’, the new graph is again a tree. 


Figure 2.1 


ProoF 1: The first proof we present, due to H. Priifer (1918), uses 
an algorithm that associates to any tree T a ‘name’ P(T) (called 
the Priifer code) that characterizes the tree. 
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For the vertices of K,,, we take the ordered set V = {1, 2,3,...,n}. 
Given a spanning tree TJ’ in K,, we let T; = T and generate a se- 
quence of trees 7),75,...,7,-1; and two sequences of vertices as 
follows: Given the tree T; with n—7+1 vertices, 7 = 1,2,...,n—1, 
let x; be the least monovalent vertex of T; and delete x; and its 
incident edge {x;, y;} from T; to obtain a tree T;,; on n—7 vertices. 
The name of T is to be 


P(T) = (1, Y2,++- sUnsia)s 


We claim that the mapping ?, from the set of all spanning trees 
in K,, to the set V"~? of all possible names, is one-to-one and onto 
(bijective). This will prove that the number of spanning trees in 
K, is n™?. 

For the tree in Fig. 2.1, where n = 10, we have (21,41) = (8,2), 
(x2, y2) = (4, 2), (73, 3) = (2, die ty (79, Yo) — (9, 10); these edges 
are the columns of the matrix below. 


3 42 5 67 1 8 Q 
22 1 1 7 1 «#10 10 #10 


So P(T) = (2,2,1,1,7,1,10,10). Don’t include yo = 10. 

To understand why P is bijective, we first note some simple facts 
about the x;’s and y;’s. First, y,_1 = n, always. This is because 
every tree (with at least two vertices) has at least two monovalent 
vertices, so the vertex n will never be the least monovalent vertex. 
Second, 4, %41,---,2n-1 and n are the vertices of the tree Ty. 
Third, {x,y}, k <i <n-—1, are exactly the edges of T),, in some 
order. 

The number of times a vertex v occurs among 41, Y2,---, Yn—2 iS 
deg;(v)—1. This is because v occurs deg (v) times among the edges 
{x;,yi$}, 1 <i <mn-—1, and exactly once in 21, 2%9,...,2%n—1, Yn-1- 
Similarly, the number of times a vertex v of T; occurs among 
Yks Ykt1s-+++>Yn—2 1S its degree in the tree Tj, less 1. In particular, 
the monovalent vertices of Ty, are those elements of V not in 


eae LQ,++- opi U {Yk Yk+ly+ +e Gait, 


and this means that x;, the least monovalent vertex of Ti, is the 
least element of {1,2,...,n} not in the above set. In particular, 
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x1 is the least element of V not in the name P(T), and we can 
uniquely determine x, from P(T’) and x,..., 24-1. 


Problem 2B. How many trees TJ are there on the set of vertices 
{1, 2,3, 4,5, 6,7} in which the vertices 2 and 3 have degree 3, vertex 
5 has degree 2, and hence all others have degree 1? Do not just 
draw pictures but consider the possible Priifer codes of these trees. 


PROOF 2: We give another proof, again by a reversible algorithm. 
Consider any mapping f from {2,3,...,n — 1} to {1,2,...,n}. 
There are n”~? such mappings f. Construct a digraph D on the 
vertices 1 to n by defining (7, f(z)), i = 2,...,n—1, to be the edges. 
Fig. 2.2 shows an example with n = 21. 

D consists of two trees ‘rooted’ at 1 and n and a number (say k) of 
circuits (directed polygons) to which trees are attached. (Directed 
trees with all the edges pointing in the direction of one vertex, called 
the root, are called arborescences.) These circuits are placed as in 
Fig. 2.2 where the rightmost vertex in the 7-th component, denoted 
by r;, is its minimal element (and 1; is the vertex on the left). The 
circuits are ordered by the condition ry < rg <--- < ry. To D we 
adjoin the tree obtained by adding the edges {1,1)}, {ri, lo}, ..., 
{re—1, le}, {rx,n} and deleting the edges {r;,1;} as in Fig. 2.3. 


21 
4 3 7 0 12 : 
16 19 
9 10 11 2 8 
15 18 
17 13 14 


Figure 2.2 


If the tree of Fig. 2.3 is given, consider the path from 1 to n 
(=21). Let ro := 1. Define r; to be the minimal number on this 
path (excluding rp = 1) and in general r; as the minimal number 
on the path from r;_; to n. It is easily seen that we recover the 
function f in this way. 
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7 20 I 


| A 5 3 ” 
> —_ -—@ > . > > t 
16 /» \ 19 
“ ; Wg 8 
7! 


0 13 14 1h 


Figure 2.3 


Generalizations of this proof may be found in Egecioglu and Rem- 
mel (1986). 


Problem 2C. Let G be a directed graph with vertices 71,..., 2, 
for which a (directed) Eulerian circuit exists. A spanning arbores- 
cence with root x; is a spanning tree JT of G, with root x;, such 
that for all 7 A 7 there is a directed path from z; to x; in T. Show 
that the number of spanning arborescences of G with root x; does 
not depend on i. (This is difficult; see the hints.) 


PROOF 3: We now give a proof by counting since it is useful to 
have seen this method. We remind the reader of the definition of 
a multinomial coefficient. Let r1,1r2,...,7, be nonnegative integers 
with sum n. Then (as) is defined by 


(2.1) (x1 +2 +---+ rR)” = S- & 7 


where the sum is over all k-tuples (r,,...,7;) with sum n. 
Since (41 +++ +24)" = (a, +--+ + 2%)" "(21 +--+ + 2%), we have 


k 
n n—-1 
2.2 = : 
( ) ae Ss eee 


i=1 


We denote the number of labeled trees with n vertices for which the 
degrees are d1, d9,..., dy, by t(n; dy, dz,...,d,). Clearly this number 
is 0 if one of the d; is 0. The value of t(n; di, d2,...,d,) depends 
only on the multiset of numbers d; and not on their order. We 
may assume without loss of generality that dj > dy > --- > dn, 
so d, = 1. Take the vertex v, corresponding to d,. It is joined to 
some vertex vu; of degree d; > 2, and any of the remaining vertices 
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is a candidate. Therefore 
(2:3) Ls diese, ) = t(n —1;dy,...,d; —1,...,dy_1). 


It is trivial to check by hand that 


n—2 
2.4 ; ner = 
( ) t(n; dy, jaa) ae a 


for n = 3. Since the numbers on the left-hand side, respectively 
right-hand side, of (2.4) satisfy the same recurrence relation ( (2.3), 
respectively (2.2) ) it follows by induction that (2.4) is true for all n. 
In (2.1), we replace n by n — 2, k by n, r; by d; — 1 and a; by 1. 
We find 

We = So t(n; dy, dg, sey iy.) 


Compare (2.4) with Problem 2B. 

A spanning tree is easily constructed by starting at any vertex, 
taking the edges to vertices at distance 1, then one edge to each 
vertex at distance 2, etc. Several other constructions are possible 
(e.g. by starting with G and deleting suitable edges). 

A graph with no polygons as subgraphs is called a forest. Each 
component C1, C,...,C, of a forest G is a tree, so if a forest with 
n vertices has k components, it has 


(1V(C1)| — 1) + (IV (C2), — 1) +--+ (IV (Ce) - lI) =n —k 


edges. 

A weighted graph is a graph G together with a function associat- 
ing a real number c(e) (usually nonnegative) to each edge e, called 
its length or cost according to context. Let us use the term ‘cost’ 
here. Given a weighted connected graph G, define the cost of a 
spanning tree T’ of G as 
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The graph may represent a network of cities where c({z, y}) is the 
cost of erecting a telephone line joining cities x and y, and so it 
is clear that finding a cheapest spanning tree in G is a problem of 
practical importance. 

The following method is often called the greedy algorithm. In fact, 
it is only one of a number of algorithms which can be called greedy 
algorithms, where one does not plan ahead but takes what seems 
to be the best alternative at each moment and does not look back. 
It is surprising that such a simple procedure actually produces a 
cheapest spanning tree, but this is proved in Theorem 2.2 below. 
Let us say that a set S of edges of a graph G is independent when 
the spanning subgraph with edge set S (denoted G:)) is a forest. 


Greedy algorithm. Let G be a connected weighted graph with 
n vertices. At each point, we will have a set {e1,¢2,...,e;} of i 
independent edges (i = 0 to start), so that G:{e1, e2,..., e;} has n—7 
components. If 7 << n—1, let e;,; be an edge with ends in different 
components of G:{e1, €2,...,e;} and whose cost is minimum with 
respect to this property. Stop when we have chosen n — 1 edges. 


Theorem 2.2. With e1,...,€n—; chosen as above, the spanning 
tree Tp := G:{e1,...,@€n-1} has the property that c(To) < c(T) for 
any spanning tree T’. 


PROOF: Let {aj, d2,...,@n—1} be the edge set of a tree T, numbered 
so that c(a,) < c(az) < +--+ < c(a@p_1). We claim something much 
stronger than c(Zp) < c(7’); namely, we claim that c(e;) < c(a;) for 
each 1 = 1,2...,n—1. If this is false, then 


c(ex) > c(ax) > C(ag_-1) > +--+ > c(ar) 


for some k. Since none of a1, a2,...,@, was chosen at the point 
when e; was chosen, each of these k edges has both ends in the 
same component of G:{e1, €2,...,¢,-1}. Then the number of com- 
ponents of G:{a1,a9,...,a,} is at least the number n — k + 1 of 
components of G':{e1, e2,..., ex-1} and this contradicts the fact that 
{a1, @2,...,@,} is independent. 


Problem 2D. Here is a variation on the above greedy algorithm. 
Let x; be any vertex of a weighted connected graph G with n 
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vertices and let 7, be the subgraph with the one vertex x; and 
no edges. After a tree (subgraph) T;, k < n, has been defined, 
let e, be a cheapest edge among all edges with one end in V(T;) 
and the other end not in V(7;,), and let 7,4; be the tree obtained 
by adding that edge and its other end to 7;. Prove that T;, is a 
cheapest spanning tree in G. 


In many practical situations, it is necessary to search through a 
tree starting from a particular vertex. (A tree with a distinguished 
vertex—the root—is called a rooted tree.) There are two well known 
methods known as depth-first search and breadth-first search. We 
explain the terminology by the example of Fig. 2.4. 


Figure 2.4 


In a depth-first search starting at a, one essentially considers the 
tree as a fence and walks around it, keeping the fence on the left, 
i.e. along the walk abdidjdbebfk...lhca. If one decides to number 
the vertices in accordance with the search, one finds the numbering 
a=1,b=2,d=3,1=4,... ,1= 12. In this description, we rely 
on a planar drawing of the tree; but see below. 

In a breadth-first search, one proceeds as in the construction of 
a spanning tree mentioned above. The vertices are then numbered 
in the order of Fig. 2.4, i.e. alphabetically. 

These ideas apply, more generally, to searching through the ver- 
tices of a connected graph. 

Given a finite connected graph G, we can obtain a numbering 
of the vertices of G and a spanning tree T of G, called a ‘depth- 
first search tree’ for G, in the following manner. Pick a vertex vp 
and start with the tree 7) with vertex vg and no edges. Proceed 
inductively: once vertices vp, V1, V2,..-., Vz and a tree Tj, with exactly 
those vertices and some of the edges of G have been chosen, let 
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be the largest index < k so that vg is adjacent to some vertex not 
in T;. Such a vertex will exist if 7), is not yet a spanning tree; 
otherwise stop and let T’ = T;. Call that new vertex vz, and add 
it and the edge {ve, vr4i1} to T), to obtain a tree T;,41. We consider 
T as a rooted tree with root vo. 

We give two properties of depth-first search trees and use them 
to give a simple constructive proof of a theorem on orientations of 
graphs. 

Given a vertex x of a rooted tree with root vp, the ancestors of x 
are the vertices traversed by the (unique) path from x to the root 
up. The first vertex other than x on that path is the parent of x. If 
x is an ancestor of y, we also say that y is a descendant of x. We 
will count x as a descendent and ancestor of itself. 


Proposition 2.3. If vertices x and y are adjacent in G, then one 
of them is a descendant of the other in any depth-first search tree 
T of G. 


PROOF: Suppose x is numbered with a smaller index than y in the 
depth-first search, say x = vz. 

At the stage when vp, v1, V2,...,U% have been chosen, the largest 
index £ so that ve is adjacent to an as-yet-unnumbered vertex is 
clearly 0 = k. So vg41 is joined to vz (and not to some v; with i < k) 
in T. If vp41 = y, we are done; y is a descendant of vz. Otherwise, 
since vu; is still adjacent to an as-yet-unnumbered vertex, namely 
y, the choice of @ will be €= k or 2=k+1, and vpy2 is adjacent to 
either vy, or vgs, in T. If vpso = y, we are done; y is a descendant 
of Uk. 

Inductively, as long as y remains unnumbered, vp, VE+1, «++ 5 Uk+j 
will be descendants of v; and the next choice of @ will be as one of 
k,k+1,...,k+ J. Then the vertex vz4;4; numbered next will be 
adjacent to one of vz, Ug41,---,Uk+j; and hence will be a descendant 
of vz. Since the graph is finite, eventually this newly numbered 
vertex must be y. 


An isthmus of a connected graph G is an edge whose deletion 
results in a disconnected graph. (Some authors call this a bridge.) 


Proposition 2.4. Let {x,y} be an edge of T which is not an 
isthmus in G; say x is the parent of y. Then there is an edge in G 
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but not in T’ joining some descendant a of y and some ancestor b 
of z. 


Proor: Let D be the set of descendants of y. Soy € Dand a ¢ D. 
Since G with {x,y} deleted is still connected, there is some edge 
{a,b} # {x,y} with one end a € D and the other end b ¢ D. This 
edge {a,b} is certainly not in T and by Proposition 2.3, b is an 
ancestor of a (since it cannot be a descendant of a because then it 
would be a descendant of y too and hence would be in D). The 
(unique) path in T from a to vp passes through y (an ancestor of 
a) and then xz (the parent of y); but 6 must be on this path (since 
it is an ancestor of a), so b must be an ancestor of x too. 


Any directed graph obtained from an undirected graph G by 
assigning a direction to each edge of G is called an orientation of 
G. A walk in a digraph D may be called strong when each edge is 
traversed by the walk according to its direction, i.e. from its tail to 
its head, and the digraph D is strongly connected when for any two 
vertices x,y, there is a strong walk from x to y. 


Theorem 2.5. Let G be a finite connected graph without isth- 
muses. Then G admits a strong orientation, i.e. an orientation 
that is a strongly connected digraph. 


PROOF: We will construct a digraph D from G by choosing a direc- 
tion for each edge. Find a depth-first search tree JT’ and numbering 
Uo, U1,-.. Of the vertices of G. Let {v;,v;} be an edge of G with 
i<j. If {u,v;} is in T, direct it from v; to v;, ie. (w%,v;) is an 
edge of D. If {v;,v;} is in not T, direct it from v; to vj, ie. (vj, vi) 
is an edge of D. 

It remains to show that D is strongly connected. There is a 
strong walk from vp to any vertex x of G (using only edges of the 
tree T’), so it will suffice to show that we can find a strong walk 
from any vertex x to Uo. 

Given a vertex x;,, k > 0, Proposition 2.4 says that some edge 
{a,b} in G but not in T joins some descendant a of y to some 
ancestor b = vu; of vz. We get a strong walk in D from vz to v; by 
appending the directed edge (a,v;) to a strong walk in T from v; 
to its descendant a. Of course, 7 < k since v; is an ancestor of vp. 
If 7 = 0 we are done. Otherwise, we repeat the argument to find 
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a strong walk from v; to some v; with 7 < 7 and concatenate the 
walks to get a strong walk from v; to v;. Continue in this manner 
until you reach vp. 


Problem 2E. A graceful labeling of a tree T on n vertices is a 
mapping f : V(T) — {1,2,...,n} so that the numbers |f (x) — f(y) 
computed across edges {x, y} are all different. Show that the path- 
graphs (trees with exactly two monovalent vertices) admit graceful 
labelings. (It is conjectured that all trees admit graceful labelings.) 


Problem 2F. Suppose a tree G has exactly one vertex of degree 
i for 2 <7 < m and all other vertices have degree 1. How many 
vertices does G have? 


Problem 2G. Let G be a graph with exactly one vertex of degree 
i for 2 <i <m and k other vertices, all of degree 1. Prove that 
k > |“). Give a construction for such a graph. 


Problem 2H. Consider labeled trivalent rooted trees T with 2n 
vertices, counting the root labeled 2n; see Figure 14.3. The labels 
are chosen in such a way that the procedure leading to P(T) has 
1,2,3,...,2n —1 as first row. How many possible codes P(T) are 
there? 


Notes. 

A. Cayley (1821-1895), professor at Cambridge from 1863 until 
his death, was one of the great mathematicians of the 19th century. 
His work includes important contributions to the theory of elliptic 
functions, analytic geometry and algebra, e.g. the theory of invari- 
ants. His paper on trees appeared in 1889 but it did not contain 
what we would consider a proof. Of the many proofs (five of which 
are treated in this book) the one by Priifer is the best known. 

H. Priifer (1896-1934) was one of I. Schur’s many pupils. He was 
professor at Miinster. 
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A. Cayley (1889), A theorem on trees, Quart. J. Pure and App. 
Math. 23, 376-378. 
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Colorings of graphs and 
Ramsey’s theorem 


We shall first look at a few so-called coloring problems for graphs. 

A proper coloring of a graph G is a function from the vertices to 
a set C of ‘colors’ (e.g. C = {1,2,3,4}) such that the ends of every 
edge have distinct colors. (So a graph with a loop will admit no 
proper colorings.) If |C| = k, we say that G is k-colored. 

The chromatic number x(G) of a graph G is the minimal number 
of colors for which a proper coloring exists. 

If x(G) = 2 (or x(G) = 1, which is the case when and only when 
G has no edges), then G is called bipartite. A graph with no odd 
polygons (equivalently, no closed paths of odd length) is bipartite 
as the reader should verify. 

The famous ‘Four Color Theorem’ (K. Appel and W. Haken, 
1977) states that if G is planar, then y(G) < 4. 

Clearly x(K,) =n. If k is odd then y(P;,) = 3. In the following 
theorem, we show that, with the exception of these examples, the 
chromatic number is at most equal to the maximum degree (R. L. 
Brooks, 1941). 


Theorem 3.1. Let d > 3 and let G be a graph in which all vertices 
have degree < d and such that Kq,, is not a subgraph of G. Then 
x(G) < d. 


PROOF 1: As is the case in many theorems in combinatorial analy- 
sis, one can prove the theorem by assuming that it is not true, then 
considering a minimal counterexample (in this case a graph with 
the minimal number of vertices) and arriving at a contradiction. 
We shall use the technique of recoloring: it is possible to change 
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the colors of certain vertices to go from one proper coloring to an- 
other. For example, let S be a subset of the set C of colors. On any 
connected component of the subgraph induced by the vertices with 
colors from S, we arbitrarily permute the colors (without changing 
those of the vertices with colors in C\S'). Clearly we again have a 
proper coloring. 


So let G be a counterexample with the minimum number of ver- 
tices. Let x € G and let I(x) = {x,...,a;}, 1 < d. Since G is 
a minimal counterexample, the graph H, obtained by deleting x 
and the edges incident with x, has a d-coloring, say with colors 
1,2,...,d. If one of these colors is not used in the coloring of I'(x), 
then we can assign this color to x and obtain a d-coloring of G. It 
follows that | = d and every d-coloring of H must use all the colors 
on the set I'(a). Let us assume that x; has color i for i = 1,2,...,d. 


Now consider x; and x; and the induced subgraph H;; of H with 
colors 7 and j. If x; and x; were in different connected components 
of H;;, then we could interchange the colors in one of these com- 
ponents, after which x; and x; would have the same color, which 
is impossible. So 2; and 2; are in the same component (say C;;) 
of H;;. We shall now show that this component is (the graph of) 
a simple path (with alternating colors 7 and j) from 2; to a;. If 
two neighbors of x; in H had color j, then the neighbors of x; in H 
would have at most d — 2 different colors. Then we could recolor 
x; and that is impossible. Suppose y is the first vertex on a path 
from x; to x; in Ci; that has degree > 3. The neighbors of y in H 
have at most d—2 colors, so we can recolor y to some color ¢ {i,j} 
and then x; and x; are no longer connected in H;;, which we know 
to be impossible. So such a y does not exist, proving that Cj; is a 
path. 


Suppose that z is a vertex 4 x; on C;; and on Cx. Then z has two 
neighbors with color 7 and two with color k. Again the neighbors 
of z in H have at most d—2 colors and z can be recolored to some 
color ¢ {i,j,k}, again a contradiction. Hence Ci; Ci, = {xi}. 

Our assumption that Ka,; ¢ G shows that there are two vertices 
in T(x), say 2; and x, that are not connected by an edge. We have 
the situation of Fig. 3.1. The vertex a is the neighbor of x; with 
color 2 on Cp. 
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We recolor H by interchanging the colors 1 and 3 on the subgraph 
C3. For the new coloring, we have new paths that we call Ci; 
Clearly a € C4, (since x; now has color 3). However, on C2 no point 
except x; has changed color, so a € Cj. Hence Ci, Cos 4 {x2}, 
contradicting what we proved above. The contradiction shows that 
our assumption that a minimal counterexample exists is false. 


Figure 3.1 


In preparation for a second proof of Brook’s Theorem, the reader 
should do the following problem without applying the theorem. 


Problem 3A. Fix an integer d > 3. Let H be asimple graph with 
all degrees < d which cannot be d-colored and which is minimal 
(with the fewest vertices) subject to these properties. (We claim H 
is complete on d+1 vertices, but we don’t know that yet.) (i) Show 
that H is nonseparable (this means that every graph obtained from 
H by deleting a vertex is connected). (ii) Then show that if the 
vertex set V(H) is partitioned into sets X and Y with |Y| > 3, 
then there are at least three vertices a,b,c € Y each of which is 
adjacent to at least one vertex in X. 


PROOF 2: Let d and H be as in Problem 3A. If H is not complete, 
there are vertices 41, %p_1,%, so that x1 is adjacent to 7,_; and x, 
but so that these last two vertices are not adjacent. We want to 
number the other n — 3 vertices so that the sequence 


U1, %2,+++5En-1, Un 
has the property that each x;,, k > 2, is adjacent to at least one of 


the vertices x; preceding it, i.e. with 7 < k. This is simple: when 
X1,22,...,2% have been chosen, k < n — 2, choose x44, to be any 
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vertex other than x,_1 or £, adjacent to one of 71, 72,..., @%; since 
there are at least three such vertices, this is always possible. 

Once this is done, we d-color the vertices starting at the end of 
the sequence. Assign z,_; and x, the same color. When the last 
k vertices ©444,...,2%n—1,%n have been colored, k > 2, there is a 
color available for x; since x, is adjacent to at most d— 1 of the 
already-colored vertices. Finally, there is a color available for x1 
since two of the vertices adjacent to x; have been given the same 
color. 


We now consider a coloring problem of a completely different 
nature. It serves as an introduction to a very important theorem of 
combinatorics, namely Ramsey’s theorem. Before reading on, the 
reader should try the following problem. 


Problem 3B. Let the edges of K7 be colored with the colors red 
and blue. Show that there are at least four subgraphs K3 with all 
three edges the same color (monochromatic triangles). Also show 
that equality can occur. 


The example that is always used to introduce this subject is K¢ 
with its edges colored red or blue. We shall show that there is at 
least one monochromatic triangle. A proof is as follows. Let a be 
any vertex. Because a has degree 5, it is incident with at least three 
edges of the same color, say red edges to the vertices b, c, d. If one 
of the edges between these three vertices is red, then we have a red 
triangle; if not, then they form a blue triangle. 

The idea of the proof is the same as for the more difficult situation 
of Ramsey’s theorem. However, it does not show as much as the 
following counting argument. 


Theorem 3.2. If the edges of K,, are colored red or blue, and r;, 
i= 1,2,...,n, denotes the number of red edges with vertex i as an 
endpoint, and if A denotes the number of monochromatic triangles, 
then 


(3.1) A= @ - 3 ne 1 7) 


PROOF: Every triangle in K, that is not monochromatic has ex- 
actly two vertices where a red and a blue edge meet. On the 7-th 
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vertex, two such edges can be chosen in r;(n — 1 — r;) ways. So the 
sum in (3.1) counts the bichromatic triangles twice. 


Corollary. 


(3.2) a> (3) - 1g 


PROOF: From (3.1) we see that A is minimized if r; = (n —1-—1;) 


for all 2 when n is odd, or if r; = 5 or 7; = 5 — 1 for all 7 in the 


case that n is even. Since A is an integer, the first situation cannot 
always arise. It is easy to show that (3.2) cannot be improved. 


Note that this argument shows that a red-blue coloring of K¢ 
must always have at least two monochromatic triangles. 
We now treat Ramsey’s theorem (Ramsey, 1930). 


Theorem 3.3. Let r > 1 and q > r, i = 1,2,...,s be given. 
There exists a minimal positive integer N(q), q2,.--,4s;1) with the 
following property. Let S be a set with n elements. Suppose that 
all (") r-subsets of S are divided into s mutually exclusive families 
T,,...,25 (‘colors’). Then ifn > N(q1,q2,---,4s;7) there is an i, 
1 <i<_s, and some q;-subset of S for which every r-subset is in 
eas 


(The reader should compare this with our introductory example 
and show that N(3, 3;2) = 6.) 


PROOF: We give the proof only for s = 2. The general case only 
involves a little more bookkeeping. 

(a) Trivially, the theorem is true for r = 1 and N(p,q;1) = 
p+q—l. 

(b) For any r and p > r it is also obvious that N(p,r;r) = p and 
similarly N(r,q;r) = q for q>r. 

(c) We proceed by induction on r. So assume the theorem is true 
for r— 1. We now use induction on p+ q, using (b). So we can 
define p} = N(p—1,q;r), a = N(p,q—1;r). Let S be a set with 
n elements, where n > 1+ N(p1,q1;r — 1). Let the r-subsets of S 
be colored with two colors, say red and blue. As in the proof for 
Kg, we pick an arbitrary element a of S. We now define a coloring 
of the (r — 1)-subsets of S’ := S\{a} by giving X C S’ the same 
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color as X U {a}. By induction S’ either contains a subset A of 
size p; such that all its (r — 1)-subsets are red or a subset B of size 
qi such that all its (r — 1)-subsets are colored blue. Without loss 
of generality the first situation occurs. Since A has N(p — 1,q;r) 
elements, there are two possibilities. The first is that A has a subset 
of g elements with all its r-subsets blue, in which case we are done. 
The other possibility is that A has a subset A’ of p — 1 elements 
with all its r-subsets red. The set A’ U {a} also has this property 
because A’ C A. This proves the theorem and furthermore we have 
shown 


(3:3), Nipgr)< NN p= 1,47); NO ¢—br r= 1) 41. 


A special case of (3.3) occurs when we go back to the coloring of 
edges (r = 2) of a graph with two colors. Using (a) from the proof, 
we find 
(3.4) N(p,q52) < N(p— 1, 9;2) + N(p,q — 1;2). 

Problem 3C. Show that equality cannot hold in (3.4) if both 


terms on the right-hand side are even. 


Theorem 3.4. 


PROOF: Since N(p, 2;2) = p, the result follows from (3.4) because 
binomial coefficients satisfy the same relation with equality. 


Let us look at what we now know about N(p,q;2). By Problem 
3C, we have N(3,4;2) < 9. To show that equality holds, we have 
to color Kg such that there is no red triangle and no blue Ky. We 
do this as follows: number the vertices with the elements of Zg. Let 
the edge {i, 7} be red if and only if i—j = +3 or i-j =4 (mod 8). 
One easily checks that this coloring does the job. 

Problem 3D. Use the same method to show that N(4, 4;2) = 18 
and that N(3,5;2) = 14. 


With a lot more work it has been shown that 


N(3,6;2) =18, N(3,7;2) = 23, N(3,8;2) =28, N(3,9;2) = 36, 
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N(4,5;2) = 25. 


No other values of N(p,q;2) are known. 

One of the interesting problems in this area that has seen virtu- 
ally no progress for 30 years is the asymptotic behavior of N(p, p; 2). 
We know by Theorem 3.4 that 


(3.5) N(p,p;2) < (? 4) < 2-2, 
p-1 

We now show that N(p,p;2) grows exponentially, using a method 
that is used quite often in combinatorics. It is often referred to 
as ‘probabilistic’ since it estimates the probability that a random 
coloring has a monochromatic K,. Consider a K,. There are 9(2) 
different ways of coloring the edges red or blue. Now fix a subgraph 
Kk,. There are 9(2)-@)+1 colorings for which that K, is monochro- 
matic. The number of colorings for which some K,, is monochro- 
matic is at most (°) times as large (because we may count some 
colorings more than once). If this number is less than the total 
number of colorings, then there exist colorings with no monochro- 
matic K,. Using the fact that (°) < n?/p!, we find that such a 
coloring certainly exists if n < 2?/? (unless p = 2). This proves the 
following theorem. 


Theorem 3.5. N(p,p;2) > 2?/?. 
From (3.5) and Theorem 3.5, we know that 


V2 < YN(p,p;2) <4 (p> 2). 


It would be very nice if one could show that this p-th root has a 
limit for p — oo. 

To give a considerable improvement of Theorem 3.5, we discuss a 
probabilistic method that is useful in many parts of combinatorics. 
We consider events A,, Ao,..., A, in a probability space. Denote 
by Pr[A;] the probability of the event A; and, as usual, let A; 
denote the complement of A;, i.e. non-occurrence of A;. We are 
interested in applications where the A; denote situations we do not 
wish to occur and where we would like to assert that there is a 
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positive probability that none of the events A; occurs. In some 
easy counting situations one can use 


S¢ Pr[A] <1 34; 49. 


i=1 


Cf. Problem 5E. However, in general, dependence among the unde- 
sirable events leads to multiple counting which results in a sum that 
is much larger than 1. Of course, if the events A; are independent, 
then it suffices that each has probability less than one to guarantee 
that non-occurrence of all of them has positive probability. The 
Lovasz Sieve handles situations where there is indeed some depen- 
dence but simultaneously there are many obviously independent 
combinations of events. 

We define what we shall call a dependency graph for the events 
Aj,..., An. This is a graph G on the set of indices {1,2,...,n} 
with the property that for every i the event A; is independent of 
every subset of {A; : {i,j} € E(G)}. Note that we require a lot 
more than that A; is independent of each of the A; in this subset. 


Theorem 3.6. Let G be some dependency graph for the events 
Aj,...,An. Suppose that Pr[Aj] < p,i=1,...,n and that every 
vertex in G has degree < d. If 4dp < 1, then NA; # 0. 


PROOF: We first show that for every subset {71,72,...,im} of the 
index set, 
——— 1 
The case m = 1 is trivial and for m = 2 we have 
= 1 1 
Pr(A,|Ay] < —2 


< ee, 
=n Ad Dd 
where for convenience of notation we have taken 1; = j and p; := 
Pr|A;]. We proceed by induction. 


Suppose that in G, 1 is adjacent to 2,3,...,q and not adjacent 
toq+1,...,m. We have 


Pr|Ai Ao ar Ag|Agi1 ee Am| 
Pr|A, cee Ag|Agut Stes An 


PHA GA aa = 
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The numerator is (by definition of G) at most 


=—_ 1 
Pr[Aj|Aqs1- scx Alga | — Pr{A\| < Ad’ 


Using the induction hypothesis, we find that the denominator is at 
least 


q 
— qg-1._1 
1— D PrlAilAga---Am] 21— “G2 5: 
This proves (3.6). We now have 
_ a n =a _——- 1 
»+- fin! = i ... Aj Ps 1——)" 
Pr[Ay... An] = || Pr{AilAi fs le ea 


i=1 


where we have used (3.6) for each term in the product. 


We apply this method to obtain a lower bound for N(p, p; 2). 
Theorem 3.7. N(p,p;2) > c-p-2°/?, where c is a constant. 


PROOF: Consider K,, and color the edges randomly with two col- 
ors. For each set S of k vertices let Ag be the event that the 
subgraph on S is colored monochromatically. We wish to assert 
that among the random colorings, there is at least one in which no 
monochromatic subgraph on k vertices occurs. We define a depen- 
dency graph by making S and T adjacent if and only if |SMT| > 2, 
i.e. the subgraphs on S and T have an edge in common. The degree 


d of G is clearly at most (5) (5) . The events Ag all have prob- 


ability 2'-(3). From Theorem 3.6, Stirling’s formula and a little 
manipulation, we find the result (and if we wish an estimate for c). 


We have given some examples of an area of combinatorics known 
as Ramsey theory. We just mention one more example, namely 
a theorem due to B. L. van der Waerden (1927). It states that 
there exists a number N(r) such that if N > N(r) and the integers 
from 1 to N are colored red or blue, then there is a monochromatic 
arithmetic progression of length r in the set. For a short (but not 
easy) proof see Graham and Rothschild (1974). A general reference 
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for this area is the book Ramsey Theory by R. L. Graham, B. L. 
Rothschild and J. L. Spencer (1980). 

An interesting application of Ramsey’s theorem is the following 
theorem due to Erdés and Szekeres (1935). 


Theorem 3.8. For a given n, there is an integer N(n) such that 
any collection of N > N(n) points in the plane, no three on a line, 
has a subset of n points forming a convex n-gon. 


PROOF: (i) First we observe that if we have n points, no three on a 
line, then they form a convex n-gon if and only if every quadrilateral 
formed by taking four of the points is convex. 

(ii) We now claim that N(n) = N(n,n;3) will do the job. Let 
S be a set of N(n) points. Number the points and then color tri- 
angles red, respectively blue, if the path from the smallest number 
via the middle one to the largest number is clockwise, respectively 
counterclockwise. There is an n-subset with all its triangles the 
same color, say red. We shall show that this set cannot contain the 
configuration of Fig. 3.2. 


PAN 


Figure 3.2 


Without loss of generality a < b < c. From triangle adc, we see 
that a << d<_.c. Then from triangle abd it follows that a < b < d. 
But then triangle bcd is blue, a contradiction. So all quadrilaterals 
formed from the n-subset are convex and by (i), we are done. 


Problem 3E. A tournament on n vertices is an orientation of K,,. 
A transitive tournament is a tournament for which the vertices can 
be numbered in such a way that (i, 7) is an edge if and only if i < 7. 
(a) Show that if k < log,n, every tournament on n vertices has 
a transitive subtournament on k vertices. 
(b) Show that if k > 1+ 2log,n, there exists a tournament on n 
vertices with no transitive subtournament on k vertices. 
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Problem 3F. Prove that for all r € N there is a minimal number 
N(r) with the following property. If n > N(r) and the integers in 
{1,2,...,n} are colored with r colors, then there are three elements 
x,y,z (not necessarily distinct) with the same color and x+y = z. 
(A result due to I. Schur.) Determine N (2). Show by an elementary 
argument that N(3) > 13. 


Problem 3G. Let m be given. Show that if n is large enough, 
every n x n (0,1)-matrix has a principal submatrix of size m, in 
which all the elements below the diagonal are the same, and all the 
elements above the diagonal are the same. 


Problem 3H. Show that if the edges of K,7 are colored with three 
colors, there must be a monochromatic triangle. 


Problem 3I. Let {1,a,a’,...,a'4} be the multiplicative group 
of F,,. Number the vertices of AKyg with the elements of Fig. We 
will color the edges of Ky with three colors. Edge {i,j} is to be 
colored with a color that depends only on v, where 7 — 7 = a”. Do 
this in such a way that there is no monochromatic triangle. (In the 
notation of Theorem 3.3, this problem and the previous one show 
that-JV (3,3; 332). = 17.) 


Problem 3J. The edges of K,, are colored red and blue in such a 
way that a red edge is in at most one red triangle. Show that there 
is a subgraph K;, with k > |V/2n]| that contains no red triangle. 


Problem 3K. Let G satisfy the conditions of Theorem 3.1. Show 
that by removing at most n/d edges we can find a subgraph G’ 
with chromatic number < d— 1. 


Notes. 

The four color conjecture was considered one of the most famous 
open problems in combinatorics until 1976. Its solution by Appel 
and Haken has caused much debate because the proof depends on 
an extensive computer analysis of many cases. The validity of the 
argument depends on one’s trust in computers and programs. (See 
Chapter 34 for two proofs of the Five Color Theorem.) 

The theorem by Brooks, Theorem 3.1, which he discovered while 
an undergraduate at Cambridge, is a typical example of the inge- 
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nious arguments that are quite often necessary in that part of graph 
theory where algebraic methods do not apply. 

F. P. Ramsey (1902-1928) died too young to produce the results 
that he probably would have. He was interested in decision pro- 
cedures for logical systems and, strangely enough, this led to the 
theorem that in turn led to so-called Ramsey theory. 

Theorem 3.5 is due to P. Erdés (1947). For more on probabilistic 
methods, see Erdés and Spencer (1974). 

Values of and estimates for the numbers N(p, q;2) can be found 
in Radziszowski (1999). The value of N(3,9;2) is from Grinstead 
and Roberts (1982). 

The proof of Theorem 3.6 is not the original proof by Erdés and 
Szekeres that one usually finds in books. This proof was produced 
by a student in Haifa (M. Tarsy) during an examination! He had 
missed the class in which the proof had been presented. See Lewin 
(1976). A proof in a similar vein was given by Johnson (1986). 


References. 

kK. Appel and W. Haken (1977), Every planar map is four-color- 
able, Illinois J. Math. 21, 429-567. 

R. L. Brooks (1941), On colouring the nodes of a network, Cam- 
bridge Philos. Soc. 37, 194-197. 

P. Erdés (1947), Some remarks on the theory of graphs, Bull. Amer. 
Math. Soc. 58, 292-294. 

P. Erdés and J. L. Spencer (1974), Probabilistic Methods in Com- 
binatorics, Academic Press. 

P. Erdés and G. Szekeres (1935), A combinatorial problem in ge- 
ometry, Compositio Math. 2, 463-470. 

R. L. Graham and B. L. Rothschild (1974), A short proof of van 
der Waerden’s theorem on arithmetic progressions, Proc. Amer. 
Math. Soc. 42, 385-386. 

R. L. Graham, B. L. Rothschild, and J. L. Spencer (1980), Ramsey 
Theory, Wiley. 

C. M. Grinstead and 8. M. Roberts (1982), On the Ramsey numbers 
R(3,8) and R(3,9), J. Combinatorial Theory (B) 33, 27-51. 

S. Johnson (1986), A new proof of the Erdés—Szekeres convex k-gon 
result, J. Combinatorial Theory (A) 42, 318-319. 


36 A Course in Combinatorics 


M. Lewin (1976), A new proof of a theorem of Erdés and Szekeres, 
The Math. Gazette 60, 136-138, 298. 

S. P. Radziszowski (1999), Small Ramsey Numbers, The Electronic 
Journal of Combinatorics 1 DS 1. 

F. P. Ramsey (1930), On a problem of formal logic, Proc. London 
Math. Soc. (2) 30, 264-286. 

B. L. van der Waerden (1927), Beweis einer Baudetschen Vermu- 
tung, Nieuw Archief voor Wiskunde 15, 212-216. 


A 


Turan’s theorem and 
extremal graphs 


As an introduction, we first ask the question how many edges a 
simple graph must have to guarantee that the graph contains a 
triangle. Since Kym and Km m+41 do not contain triangles, we see 
that if the graph has n vertices, then |n?/4| edges are not enough. 
We claim that if there are more edges, then the graph contains a 
triangle (W. Mantel, 1907). The following proof is surprising. Let 
G have n vertices, numbered from 1 to n, and no triangles. We 
give vertex i a weight z; > 0 such that 5>z; = 1 and we wish 
to maximize S := 5° 2,z;, where the sum is taken over all edges 
{i,j}. Suppose that vertex k and vertex | are not joined. Let the 
neighbors of k have total weight x, and those of / total weight y, 
where x > y. Since (z, + 6)u + (4 — €)y > zeu + z1y, we do not 
decrease the value of S if we shift some of the weight of vertex 
to the vertex k. It follows that S is maximal if all of the weight 
is concentrated on some complete subgraph of G, i.e. on one edge! 
Therefore S < 7 On the other hand, taking all z; equal to n7! 
would yield a value of n~?|E| for S. Therefore |E| < $n’. 

Note that Ramsey’s theorem states that if a graph on n vertices 
has n > N(p,q;2), then the graph either has a complete subgraph 
on p vertices or a set of g vertices with no edges between them 
(called an independent set). We now ask the question whether some 
condition on the number of edges guarantees a K,, as a subgraph. 
We saw above what the answer is for p = 3. We also already have 
an idea of how to avoid a K,. Divide the vertices into p— 1 subsets 
S},.-.,Sp-1 Of almost equal size, i.e. r subsets of size t + 1 and 
p—1-r subsets of size t, wheren = t(p—1)+r,1<r<p-1l. 
Within each S; there are no edges but every vertex in 5; is joined to 
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every vertex in S; ifi #7. (This is a complete multipartite graph.) 
The number of edges is 
P22 sy, HOH Lae) 
M(n, p) := = 
oP) =o -1)” — 2-1 
Theorem 4.1. (Turan, 1941) If a simple graph on n vertices has 
more than M(n,p) edges, then it contains a K, as a subgraph. 


ProoF: The proof is by induction on t. If t = 0, the theorem 
is obvious. Consider a graph G with n vertices, no K,, and the 
maximum number of edges subject to those properties. Clearly G 
contains a K,-1 (otherwise adding an edge would not produce a 
K,), say H. Each of the remaining vertices is joined to at most 
p—2 vertices of H. The remaining n—p+1 vertices do not contain 
a KX, as subgraph. Since n—p+1 = (t—1)(p—1)+r, we can apply 
the induction hypothesis to this set of points. So the number of 
edges of G is at most 


M(n~p+1,p) + (n~p+1)p-2)+ (? 5") 


and this number is equal to M(n, p). 


Remark. The argument that was used for Mantel’s theorem would 


. . —2 9) 
show that if there is no K,, then |E| < pay” i 


Problem 4A. Let G be a simple graph with 10 vertices and 26 
edges. Show that G has at least 5 triangles. Can equality occur? 


Turan’s paper on graph theory that contains Theorem 4.1 is con- 
sidered the starting point of what is now known as extremal graph 
theory—see Bollobas (1978). A simple instance of an extremal 
problem will ask for the maximum number of edges a graph with a 
certain property may have. The graphs whose number of edges is 
maximum are called the extremal graphs with respect to the prop- 
erty. 

The extremal graphs for Turdn’s problem are only the complete 
multipartite graphs described above. This follows from an analysis 
of the proof of Theorem 4.1; we ask the reader to do this at least 
in the case p = 3 in the problem below. 
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Problem 4B. Show that a simple graph on n vertices with |n?/4| 
edges and no triangles is a complete bipartite graph K;,;, if n = 2k, 
or Kypper it n= 2k+ 1. 


Problem 4C. If asimple graph on n vertices has e edges, then it 


has at least = (4e — n”) triangles. 


The girth of a graph G is the size of a smallest polygon P, in G. 
(A forest has infinite girth.) By definition, a graph is simple if and 
only if it has girth > 3. By Mantel’s theorem, a graph with more 
than n7/4 edges has girth < 3. 


Theorem 4.2. Ifa graph G on n vertices has more than $nv/ n—1 
edges, then G has girth < 4. That is, G is not simple or contains a 
P3 or a P, (a triangle or a quadrilateral). 


PROOF: Suppose G has girth > 5. Let yj, yo,...,yq be the vertices 


adjacent to a vertex x, where d := deg(x). No two of these are 
adjacent since G has no triangles. Moreover, no vertex (other than 
x) can be adjacent to more than one of yi,..., ya since there are no 


quadrilaterals in G. Thus (deg(y,) —1) +---+(deg(ya) — 1) +(d+1) 
cannot exceed the total number n of vertices. That is, 


S- deg(y) <n—-1. 


y adjacent to x 


Then 


n(n—1)>S>  S> deg(y) = S> deg(y)? 


x y adjacent to x 
1 : 1 
32 ps seat = -(2|E(G)|)* 
y 


The number 5ny/ n — 1 in Theorem 4.2 is only a bound—it is not 
the exact answer for all n. Determination of the extremal graphs 
for this problem (maximum number of edges subject to girth > 5) 
for all values of n is impossibly difficult; a determination of the 
graphs for which equality holds has, however, been almost possible. 
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Perhaps surprisingly, there are at most four graphs with n > 2 
vertices, girth > 5, and $n/n—T1 edges: The pentagon (n = 5), 
the Petersen graph (n = 10), one with n = 50, and possibly one 
with n = 3250. See the notes and Chapter 21. 


Problem 4D. Suppose G is regular of degree r and has girth g 
or greater. Find a lower bound for |V(G)|. (Consider the cases g 
even and odd separately.) 


It is not so interesting to ask how many edges are required to 
force a Hamiltonian circuit. But we can ask what bound on the 
minimum degree will do the job. 


Theorem 4.3. If a simple graph G on n vertices has all vertices 
of degree at least n/2, then it contains a P,, as a subgraph, i.e. it 
has a Hamiltonian circuit. 


PROOF: Suppose the theorem is not true and let G be a graph 
satisfying the hypothesis for some n but having no Hamiltonian 
circuits. We may take G to be such a counterexample with the 
maximum number of edges; then the addition of any edge to G (i.e. 
joining two nonadjacent vertices by an edge) creates a Hamiltonian 
circuit. 

Let y and z be nonadjacent vertices. Since adding {y, z} creates 
a Hamiltonian circuit, there exists a simple path from y to z with 
vertex terms, y = 21, %9,...,2%2pn = 2, say. The sets 


{i:y is adjacent to x41} 


and 

{i : z is adjacent to x;} 
each have cardinality > n/2 and are contained in {1,2,3,...,n—1}, 
so they must meet; let 79 belong to both. Then 


Y= %,72,..-. Vig, ~ = Ln, Un-1,--- » Vigtl, V1 =Y 


is the vertex sequence of a simple closed path of length n in G, 
contradicting our choice of G as a counterexample. 


Theorem 4.3 is due to G. A. Dirac and is best possible at least 
in the sense that it does not remain true if we replace n/2 by 
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(n —1)/2. For example, the complete bipartite graphs Ky,+1 have 
no Hamiltonian circuits. But it does admit improvements and 
generalizations—see e.g. Lovdsz (1979), Problem 10.21. 


Problem 4E. A 3x3x3 cube of cheese is divided into 27 1x1x1 
small cubes. A mouse eats one small cube each day and an adjacent 
small cube (sharing a face) the next day. Can the mouse eat the 
center small cube on the last day? 


Problem 4F. Let G be a simple graph with n vertices. Prove: If 
each vertex of G has degree > (n+ 1)/2, then for any edge e, there 
exists a Hamiltonian circuit of G that passes through e. 


Problem 4G. Prove the remark following Theorem 4.1. 


Problem 4H. Show that a graph on n vertices that does not 
contain a circuit on four vertices has at most 4(1+./4n — 3) edges. 


Notes. 

P. Turan (1910-1976), one of the famous Hungarian mathemati- 
cians of the 20th century, is best known for his work in analytic 
number theory and real and complex analysis. 

For every r > 2 and g > 2, there exists a graph that is regular of 
degree r and has girth > g. See Lovasz (1979), Problem 10.12. 

Analysis of the proof of Theorem 4.2 shows that a graph with 
n > 2 vertices, girth > 5, and $ny/n — 1 edges is regular of degree 
k := Vn — 1 and also that any two vertices are joined by a (unique) 
path of length 2 if they are not adjacent. With the notation of 
Chapter 21, such a graph is an srg(n,k,0,1) and the methods of 
that chapter show that k = 2,3,7, or 57. This was first shown 
in Hoffman and Singleton (1960) where in addition an example 
with k = 7 and n = 50 was described (that is now known as the 
Hoffman-Singleton graph). It is not known at this time whether 
there exists an srg(3250, 57,0, 1). 


References. 

B. Bollobas (1978), Extremal Graph Theory, Academic Press. 

A. J. Hoffman and R. R. Singleton (1960), On Moore graphs with 
diameters two and three, JBM J. Res. Develop. 4, 497-504. 
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Systems of distinct 
representatives 


We first give two different formulations of a theorem known as 
P. Hall’s marriage theorem. We give a constructive proof and an 
enumerative one. If A is a subset of the vertices of a graph, then 
denote by ['(A) the set U,-4 F(a). Consider a bipartite graph G 
with vertex set X UY (every edge has one endpoint in X and one 
in Y). A matching in G is a subset FE, of the edge set such that 
no vertex is incident with more than one edge in E;. A complete 
matching from X to Y is a matching such that every vertex in X is 
incident with an edge in Fy. If the vertices of X and Y are thought 
of as boys and girls, respectively, or vice versa, and an edge is 
present when the persons corresponding to its ends have amicable 
feelings towards one another, then a complete matching represents 
a possible assignment of marriage partners to the persons in X. 


Theorem 5.1. A necessary and sufficient condition for there to 
be a complete matching from X to Y in G is that |[(A)| > |A| for 
every AC X. 


PROOF: (i) It is obvious that the condition is necessary. 

(ii) Assume that |['(A)| > |A| for every A C X. Let |X| =n, 
m <n, and suppose we have a matching M with m edges. We shall 
show that a larger matching exists. (We mean larger in cardinality; 
we may not be able to find a complete matching containing these 
particular m edges.) 

Call the edges of M red and all other edges blue. Let x7 € X 
be a vertex not incident with an edge of the matching. We claim 
that there exists a simple path (of odd length) starting with xp) and 
a blue edge, using red and blue edges alternately, and terminating 
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with a blue edge and a vertex y not incident with an edge of the 
matching. If we find such a path p, we are done because we obtain 
a matching with m+ 1 edges by deleting the red edges of p from 
M and replacing them with the blue edges of p. In other words, 
we switch the colors on the edges of p. 

Since |['({ao})| > 1, there is a vertex y; adjacent to xp (obviously 
by a blue edge since xp is not incident with any red edges). If y, 
is also not incident with any red edges, we have the required path 
(of length one); if y; is incident with a red edge, let x; be the other 
end of that red edge. Recursively, define 79, 71,... and yy, y2,... as 
follows. If %,271,...,@, and y1,..., yx have been defined, then since 
IC({xo,21,.--,2x})| > &+1, there exists a vertex y,+1, distinct from 
Yi,--+-,Yk, that is adjacent to at least one vertex in {x9,%1,..., rx}. 
If yz41 is not incident with a red edge, stop; otherwise, let 7,41 be 
the other end of that red edge. 

When the procedure terminates, we construct the path p by start- 
ing with y;,+; and the blue edge joining it to, say, z;,, 74; < k+1. 
Then add the red edge {x;,, y;,}. By construction, y;, is joined by 
an edge (necessarily blue) to some 2;,, 12 < i;. Then add the red 
edge {2;,, yi, }. Continue in this way until xo is reached. 


Problem 5A. A perfect matching in a graph G (not necessarily 
bipartite) is a matching so that each vertex of G is incident with 
one edge of the matching. (i) Show that a finite regular bipartite 
graph (regular of degree d > 0) has a perfect matching. (ii) Find a 
trivalent (regular of degree 3) simple graph which does not have a 
perfect matching. (iii) Suppose G is bipartite with vertices X UY 
(every edge having one end in X and one in Y). Further assume 
that every vertex in X has the same degree s > 0 and every vertex 
in Y has the same degree t. (This condition is called semiregular- 
ity.) Prove: If |X| < |Y| (equivalently, if s > t), then there is a 
complete matching M of X into Y. 


Example 5.1. A parlor trick involving a standard deck of 52 cards 
is as follows. You are dealt five cards at random. You keep one and 
put the other four (in a specific order) into an envelope which is 
taken to your partner in another room. Your partner looks at these 
and announces the name of the fifth card, that you had retained. 
Using the values of suits and ranks, it is possible to think of clever 
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or relatively simple ways to determine which card to keep and which 
to pass, and for the partner to determine the retained card; see 
the notes. Ignoring complexity of solution, however, the relevant 
general mathematical problem, stated for a deck of N cards, is 
does there exist an injective mapping f from the set X of the (2 ) 
5-element subsets of the N cards into the set Y of N(N — 1)(N — 
2)(N —3) ordered 4-tuples of distinct cards, subject to the condition 
that if f(S) = (c1, ca, ¢3,c4), then {c1, c2,¢3,c4} C S. 

In terms of matchings, we consider the bipartite graph G whose 
vertices are X UY as defined above, and where there is an edge 
joining S € X to {c1,c2,c3,c4} € Y exactly when {cj, co, c3,c4} C 
S. We require a complete matching M of X into Y. The reader 
may check that G is semiregular and that |X| < |Y| if and only if 
N < 124. So by Problem 5A(iii), for N < 124, there exists such a 
matching. 


We now reformulate Theorem 5.1 in terms of sets and not only 
prove the theorem but also give a lower bound for the number of 
matchings. We consider subsets Ag, Aj,...,An—1 of a finite set S. 
We shall say that this collection has property H (Hall’s condition) 
if (for all &) the union of any k-tuple of subsets A; has at least k 
elements. If the union of some k-tuple of subsets contains exactly 
k elements (0 < k < n), then we call this k-tuple a critical block. 

We define a system of distinct representatives (SDR) of the sets 
Ao,...,An-1 to be a sequence of n distinct elements ag,...,@n—1 
with a; € Aj, O<i<n-1. 

Let mp < my, < +--+ << my_1. We define 


n-1 


Eig itn) [[o —t)x, 
i=0 
where (a), := max{1,a}. 
From now on, we assume that the sequence m; := |A;| is nonde- 
creasing. 
For the proof of the main theorem, we need a lemma. 


Lemma 5.2. For n > 1, let f, : Z"” — N be defined by 


fr(ao, Ql,+++; Gn) = Fi(mo, My, ++. ,™Mn-1) 
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if (mMo,.--,;Mn—1) is a nondecreasing rearrangement of the n-tuple 
(ag,---,@n—1). Then f,, is nondecreasing with respect to each of the 
variables a;. 


PrRoor: Let 


mo S++ S Mp1 Sai = Me S Me S °°: 
<m <S Mig S00 S Mp1 


be a nondecreasing rearrangement of (ao,...,@n—1). If a; > a; and 


ify Stes Nig = i Se SES 0, So 8 eS iA 


is a nondecreasing rearrangement of (do,...,@i—1, @), Gi41,.--, @n—1) 
then 
/ 
Fn(ao, +++) Qj-1,4;, Qi+1,---, Gi) ne 
fr(ao, eee | G1.) 


Gti = Be EDs. 


(a; — k). (m; — 1). joktl 


(my41 2); 
(mj — 7). 


and this is > 1 since a; < mpi, af > mi, and m4, > m,; for 
jHk+1,...,J-1. 


We now come to the second form of Hall’s theorem. We denote 
by N(Ap,..., An—1) the number of SDRs of (Ao,..., An-1). 


Theorem 5.3. Let (Ao,...,An—1) be a sequence of subsets of a set 
S. Let m; := |A;| (@=0,...,2—1) and let mp < my < +--+ < my_1. 
If the sequence has property H, then 


N(Ao, sey An-1) > F,(mo, Soe Mi A 


PROOF: The proof is by induction. Clearly the theorem is true for 
n= 1. We distinguish two cases. 

Case 1. There is no critical block. In this case, we choose any 
element a of Ao as its representative and then remove a from all the 
other sets. This yields sets, that we call Aj(a),...,An—i(a), and 
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for these sets property H still holds. By the induction hypothesis 
and by the lemma, we find 


ac Ag 
= >». Fravln + 1, vee My-1 — 1) 
ace Ao 
am Mo fn—-1(m1 a 1, see Mn-1 1) 
= Fi(mo, My1,.-- Mine). 
Case 2. There is a critical block (A,,,...,Av,_,) with v9 < +--+ < 
y,-1 and 0 < k < n. In this case, we delete all elements of A,, U 
---UA,,_, from all the other sets A; which produces A),,,...,Aj,,_ 45 
where {i,...,Vg—1, 0,---, Mi-1} = {0,1,...,.n-—1}, A+l=n. 
Now both (A,,,---,;A:,_,) and (A,,,,---,Aj,,_,) satisfy property 


H and SDRs of the two sequences are always disjoint. Hence by 
the induction hypothesis and the lemma, we have 


o.L 

ce eget lH Ay Apa ices yAn aN (Ais ae A) 
PR ienase teas 1 Aa exes A |) 
ST Miyes ox 2 Mp: 3 ITM ng — Rie a) 
Pf ( Gig fess) Figs A Tg 


Now we remark that 

My. 1S |\Ay, Ui W Ay, |=; 
and therefore we have 

(mp —1T),=1 Uh ST Spi 


and 
acre —k—- ae =] if Li < VR-1- 
This implies that 


TRGgy oes a) = I] (m; — t)x, 


IN Mig = hy yy 4g =k) = I] (i =F) 


Ve—1<j<n 
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i.e. the product (5.1) is equal to Fi,(mo,...,7™n-1), which proves 
the theorem. 


Problem 5B. Show that Theorem 5.3 gives the best lower bound 
for the number of SDRs of the sets A; that only involves the num- 
bers |A;|. 


We now come to a theorem known as Konig’s theorem. It is 
equivalent (whatever that means) to Hall’s theorem. In the theo- 
rem, A is a (0,1)-matrix with entries a;;. By a line, we mean a row 
or a column of A. 


Theorem 5.4. The minimum number of lines of A that contain 
all the 1’s of A is equal to the maximum number of 1’s in A, no 
two on a line. 


PROOF: Let m be the minimum number of lines of A containing all 
the 1’s of A and let M be the maximum number of 1’s, no two on 
a line. Clearly m > M. Let the minimum covering by lines consist 
of r rows and s columns (r+ s = m). Without loss of generality, 
these are the first r rows and the first s columns. We now define 
sets Aj, 1 <i<r, by Aj := {j > 8: a, = 1}. If some k-tuple of 
the A;’s contained less than k elements, then we could replace the 
corresponding k rows by k — 1 columns, still covering all the 1’s. 
Since this is impossible, we see that the A;’s satisfy property H. So 
the A;’s have an SDR. This means that there are r 1’s, no two ona 
line, in the first r rows and not in the first s columns. By the same 
argument there are s 1’s, no two on a line, in the first s columns 
and not in the first r rows. This shows that M > r+s =m and 
we are done. 


The following theorem of G. Birkhoff is an application of Hall’s 
theorem. 


Theorem 5.5. Let A = (a;;) be ann Xx n matrix with nonnegative 
integers as entries, such that every row and column of A has sum 
1. Then A is the sum of | permutation matrices. 


PROOF: Define A;, 1 <7 < n, by A; := {j : aj; > O}. For any 
k-tuple of the A;’s, the sum of the corresponding rows of A is kl. 
Since every column of A has sum J, the nonzero entries in the 
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chosen k rows must be in at least & columns. Hence the A,’s satisfy 
property H. An SDR of the A;’s corresponds to a permutation 
matrix P = (p;;) such that a;; > 0 if p;; = 1. The theorem now 
follows by induction on J. 


Problem 5C. In the hypothesis of Theorem 5.5, we replace ‘inte- 
gers’ by ‘reals’. Show that in this case, A is a nonnegative linear 
combination of permutation matrices. (Equivalently, every dou- 
bly stochastic matrix—see Chapter 11—is a convex combination of 
permutation matrices. ) 


Problem 5D. Let S be the set {1,2,...,mmn}. We partition S into 
m sets Aj,...,Am of size n. Let a second partitioning into m sets 
of size n be By,..., By». Show that the sets A; can be renumbered 
in such a way that A; B; £ 0. 


Problem 5E. Let A; = {i-1,1,i+1}N{1,2,...,n},2=1,2,...,n. 
Let S;,, denote the number of SDR’s of the collection {Aj,..., An}. 
Determine S,, and lim,-.. gue 

Let G be a bipartite graph, finite or infinite; say the vertex set 
is partitioned into sets X,Y of vertices so that every edge of G has 
one end in X and one end in Y. We say that a matching M in G 
covers a subset S of the vertices when every vertex in S is incident 
with one of the edges in M. 


Theorem 5.6. If there exists a matching M, that covers a subset 
Xo of X and there exists a matching M, that covers a subset Yo of 
Y, then there exists a matching M3 that covers Xo U Yo. 


PROOF: Think of the edges of M, as ‘red edges’ and the edges of 
Mp as ‘blue edges’. If an edge belongs to both M, and Mg, it is 
‘purple’. 

A connected graph all of whose vertices have degree at most 
two is easily seen to be one of: a finite path-graph (allowing the 
trivial case of length 0, when the component has one vertex and 
no edges), a finite polygon, an infinte ‘one-sided’ path-graph (with 
one monovalent vertex), or an infinite ‘two-sided’ path-graph. The 
graph H whose vertices are those of G and whose edges M, U Mp 
has the property that every vertex has degree at most two, so its 
connected components are of the types enumerated above. The 
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edges of any of these components, other than the graphs consisting 
of a purple edge and its two ends, are alternately colored red and 
blue; in particular, all polygons have even length. Every vertex of 
Xo U Yo is in one of these nontrivial components. 

For the matching M3, we will take all purple edges, and either 
all red edges or all blue edges from every other component of H. 
From the cycles and infinite two-sided paths, it doesn’t matter; 
take all red or all blue edges, and they will cover all vertices of the 
component. From the paths of odd length and infinite one-sided 
paths, take all red or blue edges depending on whether the first 
edge is red or blue (for a path of odd length, the first and last edge 
have the same color, so it doesn’t matter what side you start on). 
Again, the selected edges will cover all vertices of the component. 

We have to think just a tiny bit harder for a component that is a 
finite path P of even length, with vertices vp, v1,..., Uz, Say. There 
is an odd number of vertices and they alternate between X and Y, 
SO Up and vu, are both in X or both in Y. If they are both in X, 
take all red edges of P (those of MM) and put them in Ms; if they 
are both in Y, take all blue edges of P and put them in M3. Only 
one end of the path is not covered by the chosen edges. 

Consider the case that both vg, uv, € X (the case when they are 
in Y is completely analogous). If the first edge of P is red, then the 
last is blue and it follows that vu; ¢ Xo since no edge of M; covers 
vp. Thus the red edges of P still cover all vertices of Xo and Yo that 
were in P. Similarly, if the first edge of P is blue, then vp € Xo and 
the red edges of P still cover all vertices of Xp and Yo that were in 
P, 


For the case Xp = X, Yo = Y, and when the graph G is complete 
bipartite, matchings that cover Xo or Yo correspond to, or can be 
interpreted as, injective mappings X — Y or Y — X, respectively. 
Theorem 5.6 says: 


Corollary. If X and Y are sets and there exist injective map- 
pings f : X — Y andg: Y — X, then there exists a bijective 
mapping from X to Y, or from Y to X, i.e. there is a one-to-one 
correspondence between the two sets. 


In terms of ‘cardinality’ of sets, this says that if |X| < |Y| and 
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lY| < |X], then |X| = |Y|. This is the Schréder-Bernstein Theo- 
rem; see Section 22 of P. R. Halmos (1974). It is trivial for finite 
sets, of course. 


Problem 5F. Let A, Ao,...,A, be finite sets. Show that if 


|A;N A,| 
ees 
2, |Ail - [Ay] 


1<i<j<n 
then the sets A;, Ao,... A, have a system of distinct representatives. 


Problem 5G. (i) From Problem 5A we know that a bipartite 
graph on 2n vertices that is regular of degree 3 has a perfect match- 
ing. How many different perfect matchings are there if n = 4? (ii) 
The same question for a bipartite graph on 10 vertices that is reg- 
ular of degree 4. 


Notes. 

Philip Hall published his result in 1935 (with a rather difficult 
proof). The proof that we gave is a generalization of ideas of Hal- 
mos and Vaughan, Rado, and M. Hall. The proof is due to Ostrand 
(1970) and Hautus and Van Lint (1972). See Van Lint (1974). The 
problem of complete matchings is often referred to as the marriage 
problem. 

D. Konig (1884-1944) was professor at Budapest. He wrote the 
first comprehensive treatise on graph theory (Theorie der endlichen 
und unendlichen Graphen, 1936). Konig (1916) contains the first 
proof of (one of the theorems called) Kénig’s theorem. 

Just before Theorem 5.4, we referred to the ‘equivalence’of these 
theorems. This expression is often used when each of two theorems 
is more or less an immediate consequence of the other. 

The theorem by Birkhoff (1946), i.e. Theorem 5.5, is extremely 
useful and will be applied a number of times in later chapters. 

For the card problem mentioned in Example 5.1, here is one so- 
lution. In a set of five cards, some suit must be represented at 
least twice. The first card you pass to your partner should be 
the ‘larger’ of two cards of the same suit, and you will retain the 
‘smaller’, where we think of the ranks 2,3,...,10, J,Q, K, A as ar- 
ranged clockwise on a circle (modulo 13), and by ‘smaller’ we mean 
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the card from which we must travel the least distance clockwise 
to get to the other. For example, if S = {3@,Q¢, 6&, 30, 7a}, 
you pass either the 7@ or 34 to your partner. This already tells 
your partner the suit of the retained card and limits that card to 
six possibilities in that suit. To determine how far to count back 
(counterclockwise) from the rank of the first card, you and your 
partner use the order of the remaining three cards (the 52 cards 
are ordered lexicographically), and agree on some correspondence 
between the six permutations of three objects and the integers from 
1 to 6. 


References. 

G. Birkhoff (1946), Tres observaciones sobre el algebra lineal, Univ. 
Nac. Tucuman, Rev. Ser. A, 5, 147-151. 

P. Hall (1935), On representatives of subsets, J. London Math. Soc. 
10, 26-30. 

P. R. Halmos (1974), Naive Set Theory, Springer-Verlag. 

D. Konig (1916), Uber Graphen und ihre Anwendung auf Determi- 
nantentheorie und Mengenlehre, Math. Annalen 77, 453-465. 

J. H. van Lint (1974), Combinatorial Theory Seminar Eindhoven 
University of Technology, Lecture Notes in Mathematics 382, 
Springer-Verlag. 

P. Ostrand (1970), Systems of distinct representatives, J. of Math. 
Analysis and Applic. 32, 1-4. 


6 


Dilworth’s theorem and 
extremal set theory 


A partially ordered set (also poset) is a set S with a binary relation 
< (sometimes C is used) such that: 

(i) a <a for all a € S (reflexivity), 

(ii) if a < band b<c then a < c (transitivity), 

(iii) if a <b and b< a then a= b (antisymmetry). 

If for any a and 6 in S, either a < b or b < a, then the partial 
order is called a total order, or a linear order. Ifa < banda b, 
then we also write a < b. Examples of posets include the integers 
with the usual order or the subsets of a set, ordered by inclusion. 
If a subset of S is totally ordered, it is called a chain. An antichain 
is a set of elements that are pairwise incomparable. 

The following theorem is due to R. Dilworth (1950). This proof 
is due to H. Tverberg (1967). 


Theorem 6.1. Let P be a partially ordered finite set. The min- 
imum number m of disjoint chains which together contain all ele- 
ments of P is equal to the maximum number M of elements in an 
antichain of P. 


PROOF: (i) It is trivial that m > M. 

(ii) We use induction on |P|. If |P| = 0, there is nothing to 
prove. Let C be a maximal chain in P. If every antichain in P\C 
contains at most MM — 1 elements, we are done. So assume that 
{a1,...,@,} is an antichain in P\C. Now define S" := {x € P: 
4;[x < a;]}, and define St analogously. Since C is a maximal chain, 
the largest element in C is not in S~ and hence by the induction 
hypothesis, the theorem holds for S~. Hence S~ is the union of 
disjoint chains S| ,...,5),;, where a; € S;. Suppose x € S; and 
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x > a;. Since there is a j with x < a;, we would have a; < aj, 
a contradiction. This shows that a; is the maximal element of the 
chain S;,i=1,...,m. We do the same for S*. By combining the 
chains the theorem follows. 


A ‘dual’ to Dilworth’s theorem was given by Mirsky (1971). 


Theorem 6.2. Let P be a partially ordered set. If P possesses no 
chain of m+ 1 elements, then P is the union of m antichains. 


PROOF: For m = 1 the theorem is trivial. Let m > 2 and assume 
that the theorem is true for m-—1. Let P bea partially ordered set 
that has no chain of m-+1 elements. Let M be the set of maximal 
elements of P. M is an antichain. Suppose x71 < rg < +--+: < @m 
were a chain in P\M. Then this would also be a maximal chain in 
P and hence we would have x, € M, a contradiction. Hence P\M 
has no chain of m elements. By the induction hypothesis, P\M is 
the union of m — 1 antichains. This proves the theorem. 


The following famous theorem due to Sperner (1928) is of a sim- 
ilar nature. This proof is due to Lubell (1966). 


Theorem 6.3. If Aj, Ao,..., Am are subsets of N := {1,2,...,n} 
such that A; is not a subset of A; ifi Aj, thenm < (ie) 


PROOF: Consider the poset of subsets of N. A:= {Aj,..., Am} is 
an antichain in this poset. 

A maximal chain C in this poset will consist of one subset of each 
cardinality 0,1,...,, and is obtained by starting with the empty 
set, then any singleton set (n choices), then any 2-subset containing 
the singleton (n — 1 choices), then any 3-subset containing the 2- 
subset (n — 2 choices), etc. Thus there are n! maximal chains. 
Similarly, there are exactly k!(n—k)! maximal chains which contain 
a given k-subset A of N. 

Now count the number of ordered pairs (A,C) such that A € A, 
C is a maximal chain, and A € C. Since each maximal chain C 
contains at most one member of an antichain, this number is at 
most n!. If we let a; denote the number of sets A € A with |A| =k, 
then this number is }>;_) a¢k!(n — k)!. Thus 


So axkl(n —k)!<n!, or equivalently, S- ae 
k=0 


=i 
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n 


;) is maximal for k = [n/2] and SSa, = m, the result 


Since ( 
follows. 


Equality holds in Theorem 6.3 if we take all |n/2|-subsets of N 
as the antichain. 

We now consider the poset B,, (with 2" elements) of the subsets 
of the n-set N, ordered by inclusion. The set of 7-subsets of N is 
denoted by A;. We define a symmetric chain in B,, to be a sequence 
Py, Pesi,---,Pn—,~ of vertices such that P; € A; and P; C P;., for 
i= k,k+1,...,n -—k—1. We describe an algorithm due to De 
Bruijn, Van Ebbenhorst Tengbergen and Kruyswijk (1949), that 
splits B,, into (disjoint) symmetric chains. 

Algorithm: Start with B,. Proceed by induction. If B, has 
been split into symmetric chains, then for each such symmetric 
chain P,,...,P,—, define two symmetric chains in B,,,, namely 
Prat, sey Paap and Pr, P,U{n+1}, PyiiU{n+1}, wey P,-pU{nt+1}. 

It is easy to see that this algorithm does what we claim. Fur- 
thermore it provides a natural matching between k-subsets and 
(n — k)-subsets in B,, (cf. Theorem 5.1). Also, see Problem 6D 
below. 


Problem 6A. Let aj, a2,...,@,)24,; be a permutation of the inte- 
gers 1,2,...,n?+1. Show that Dilworth’s theorem implies that the 
sequence has a subsequence of length n + 1 that is monotone. 


A nice direct proof of the assertion of Problem 6A is as follows. 
Suppose there is no increasing subsequence of n + 1 terms. De- 
fine b; to be the length of the longest increasing subsequence that 
starts with the term a;. Then by the pigeonhole principle, there are 
at least n+ 1 terms in the b;-sequence that have the same value. 
Since 7 < j and b; = b; imply that a; > a;, we have a decreasing 
subsequence of n + 1 terms. 

To show a connection between Chapters 5 and 6, we now prove 
that Theorem 5.1 immediately follows from Theorem 6.1. We 
consider the bipartite graph G of Theorem 5.1. Let |X| = n, 
[Y| = n' > n. We introduce a partial order by defining 2; < y; 
if and only if there is an edge from vertex x; to vertex y;. Suppose 
that the largest antichain contains s elements. Let this antichain be 
{€1,.-.,%n,yi,---, ys}, where h+k =. Since [({z1,...,2,}) C 
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Y\{y1,---, yr}, we have h < n’—k. Hence s < n’. The partially 
ordered set is the union of s disjoint chains. This will consist of 
a matching of size a, the remaining n — a elements of X, and the 
remaining n’ — a elements of Y. Therefore n+n'-—a=s <1’, ie. 
a >n, which means that we have a complete matching. 

Theorem 6.3 is a (fairly easy) example of an area known as ez- 
tremal set theory in which the problems are often quite difficult. 
We first give one more example as an easy exercise. 


Problem 6B. Let the sets A;, 1 <i < k, be distinct subsets of 
{1,2,...,n}. Suppose A; A; # @ for all i and j. Show that 
k <2"! and give an example where equality holds. 


We now give one more example of the method that we used to 
prove Sperner’s theorem. We prove the so-called Erdés—Ko—Rado 
theorem (1961). 


Theorem 6.4. Let A= {Aj,..., Am} bea collection of m distinct 
k-subsets of {1,2,...,n}, where k < n/2, with the property that 
any two of the subsets have a nonempty intersection. Then m < 
(21): 

PROOF: Place the integers 1 to n on a circle and consider the family 
F := {F\,...,F,} of all consecutive k-tuples on the circle, i.e. F; 
denotes {7,i+1,...,i+k—1} where the integers should be taken 
mod n. We observe that |AM F| < k because if some F; equals Aj, 
then at most one of the sets {2,J+1,...,1+k-—1}, {l—k,...,1-—1} 
(i <1 <i+k) isin A. The same assertion holds for the collection 
F* obtained from F by applying a permutation 7 to {1,...,n}. 
Therefore 

Di= So AN F"| Skea. 
TES ny, 
We now count this sum by fixing A; € A, F; € F and observing 


that there are k!(n —k)! permutations 7 such that F7 = A;. Hence 
“v=m-n-kl(n—k)!. This proves the theorem. 


By a slight modification of the proof, one can show that the 
theorem also holds if the sets in A are assumed to have size at 
most k and they form an antichain. However we shall give a proof 
using Theorem 5.1. 
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Theorem 6.5. Let A= {Aj,...,Am} bea collection of m subsets 
of N := {1,2,...,n} such that A; Z A; and A;N A; 4 0 ift F j 


and |A;| <k < n/2 for alli. Then m < () 


Proor: (i) If all the subsets have size k, then we are done by 
Theorem 6.4. 

(ii) Let A,,...,A, be the subsets with the smallest cardinality, 
say | < }—1. Consider all the (+ 1)-subsets B; of N that contain 
one or more of the sets A;, 1 <i < s. Clearly none of these is in A. 
Each of the sets A;, 1 <i < s, is in exactly n — 1 of the B;’s and 
each B; contains at most 1+ 1<n-—1 of the A;’s. So by Theorem 
5.1, we can pick s distinct sets, say B,,...,B,, such that A; C B;. 
If we replace Aj,...,A; by B,,...,B;, then the new collection A’ 
satisfies the conditions of the theorem and the subsets of smallest 
cardinality now all have size > 1. By induction, we can reduce to 
case (i). 


By replacing the counting argument of the proof of Theorem 6.4 
by an argument in which the subsets are counted with weights, we 
can prove the following generalization due to B. Bollobas (1973). 


Theorem 6.6. Let A= {Aj,..., Am} bea collection of m distinct 
subsets of {1,2,...,n}, where |A;| <n/2 fori =1,...,m, with the 
property that any two of the subsets have a nonempty intersection. 


Then 
ae 
n-1 ae 
i=l (Ay)-1) 
PROOF: Let a be a permutation of 1,2,..., placed on a circle 


and let us say that A; € 7 if the elements of A; occur consecutively 
somewhere on that circle. By the same argument as in the proof 
of Theorem 6.4 we see that if A; € 7, then A; € a for at most |Aj| 
values of 7. 
Now define ' 
f@a) _ { TA)? if A; € 7 


0, otherwise. 


By the argument above }0,-5 oj, f(7,7) < n!. Changing the 
order of summation we have to count, for a fixed A;, the number of 
permutations 7 placed on a circle such that A; € 7. This number 
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(by the same argument as in Theorem 6.4) is n-|Aj;|!(n — |A;|)!. So 


we have 
m 


1 
S- lad” |A;l!(n — |Aj|)! < nl, 
i=l : 


which yields the result. 


Problem 6C. Let A= {A),...,Am} be a collection of m distinct 
subsets of N := {1,2,...,n} such that if i # j then A; Z Aj, 
A; a Aj ca 0, A; U Aj a N. Prove that 


Problem 6D. Consider the decomposition of B,, into symmetric 
chains as described above. Show that Theorem 6.3 is an immediate 
consequence of this decomposition. Show that Theorem 6.5 reduces 
to Theorem 6.4 via this decomposition. How many of the chains 
have their smallest element in A;? 


Problem 6E. Here is an algorithm to construct a symmetric chain 
in the poset B, which contains a given element S (a subset of 
{1,2,...,n}). Consider the characteristic vector x of S; for ex- 
ample, ifn = 7 and S = {3,4,7}, then = 0011001. Mark 
all consecutive pairs 10, temporarily delete these pairs and again 
mark all consecutive pairs 10, and repeat until only a string of the 
form 00---01---11 remains. In our example, we obtain 0011001, 
where the i-th coordinates are marked for 7 = 3, 4, 5,6; when these 
are deleted, the string 001 remains. The characteristic vectors of 
the subsets in the chain are obtained by fixing all marked coordi- 
nates and letting the remaining coordinates range over the strings 
0---000, 0---001, O0---O011, ..., 1---111. In our example, these 
characteristic vectors are 


0011000, 
0011001, 
0111001, 
1111001, 
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which correspond to the subsets 
foeene toy The Hota the 412 oot Ths 


Show that this algorithm produces exactly the same symmetric 
chain containing S as is produced by the inductive algorithm of De 
Bruijn et al. described above. 


Notes. 

We shall return to partially ordered sets in Chapters 23 and 25. 

E. Sperner (1905-1980) is best known for a lemma in combina- 
torial topology known as ‘Sperner’s lemma’, which occurred in his 
thesis (1928). It was used to give a proof of Brouwer’s fixed point 
theorem. (Another connection to combinatorics: his first professor- 
ship was in Kénigsberg!) He was one of the pioneers of the famous 
Oberwolfach research institute. 

For a survey of extremal set theory, we refer to Frankl (1988). 

The short proof of the Erdéds—Ko—Rado theorem is due to Katona 
(1974). Theorem 6.5 is due to Kleitman and Spencer (1973) and 
Schénheim (1971). The proof of Theorem 6.6 is due to Greene, 
Katona and Kleitman (1976). 
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Flows in networks 


By a transportation network, we will mean a finite directed graph 
D together with two distinguished vertices s and t called the source 
and the sink, respectively, and which is provided with a function 
c associating to each edge e a nonnegative real number c(e) called 
its capacity. We may further assume that there are no loops, no 
multiple edges, and that no edges enter the source s or leave the sink 
t (although there would be no harm in admitting any of these types 
of edges other than our having to be more careful in a definition or 
two). 


Figure 7.1 


In Fig. 7.1 we give an example. We could think of a network 
of pipes through which some liquid could flow in the direction of 
the arrows. The capacity would indicate the maximal possible flow 
(per time unit) in that section of pipe. 

A flow in a transportation network is a function f assigning a 
real number f(e) to each edge e such that: 

(a) 0 < f(e) < c(e) for all edges e (the flow is feasible); 

(b) for each vertex x (not the source or the sink) the sum of the 
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values of f on incoming edges equals the sum of the values of f on 
outgoing edges (conservation of flow). 


The sum of the values of a flow f on the edges leaving the source 
is called the strength of the flow (denoted by |f|). It seems obvious 
that the strength of the flow is also equal to the sum of the values 
of f on edges entering the sink; the reader might try to verify this 
formally before reading further. 


One of our objectives will be to find a method for constructing 
a maximum flow, that is, a flow with maximum strength. Before 
we begin, it will be good to have a goal or an upper bound for the 
strength of a flow; for example, the sum of the capacities of all edges 
leaving the source is clearly such an upper bound. More generally, 
by a cut separating s and t (or simply a cut), we mean here a pair 
(X,Y) of subsets of the vertex set V := V(D) which partition V 
and such that s € X and t € Y. We define the capacity c( X,Y) of 
the cut to be the sum of the capacities of the edges directed from 
X to Y (that is, edges e = (x,y) with € X andy € Y). We claim 
that the capacity of any cut is an upper bound for the strength of 
any flow. More strongly, we claim that the conservation law implies 
(see below) that the strength of a flow f can be computed as 


(7.1) fl = F(X) -— £X), 


where f(A,B) denotes the sum of the values of f on all edges 
directed from A to B; then the feasibility of f immediately implies 
that |f| < c(X,Y). Thus the minimum capacity of all cuts in a 
network (e.g. in Fig. 7.1 the minimum cut capacity is 20) is an 
upper bound for the strength of a flow in that network. 


To establish (7.1), we introduce the function ¢ by defining for 
each pair (z,e), where x is a vertex incident with the edge e, 
o(xz,e) := —1 if the edge is incoming, and ¢(z,e) := +1 if the 
edge is outgoing; ¢(x,e) is to be 0 if x is not incident with e. (We 
remark that @¢ is essentially the incidence matrix of the directed 
graph—see Chapter 36.) The conservation law is equivalent to 
Yecr O(2, e) f(e) = 0 for x # s,t. Notice that >>, o(z,e) is +1 
if e is directed from X to Y, —1 if e is directed from Y to X, and 
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0 if e has both endpoints in X or both in Y. Then 


Ifl= > 6, fe) = >>> ol, e) fe) 
eck we X eCH 
=S/ fle) >) o(a,e) = (X,Y) — f(Y, X). 


eck cExX 


(In the first double sum above, the inner sum is 0 for all terms x 
other than s.) 

A special instance of (7.1) is |f| = f(V\{t}, {t}), the assertion 
which we invited the reader to reflect on earlier. 

We now construct flows. Fix a flow f, possibly the 0-flow. We 
shall say that the sequence 29, %1,...,Up—1, Xp Of distinct vertices is 
a special path from xp to xz if for each i, 1 <7i<k, either 


(i) e = (a;_1, 2;) is an edge with c(e) — f(e) > 0, or 
(ii) e = (aj, 4;-1) is an edge with f(e) > 0. 


Edges e with f(e) = c(e) are said to be saturated and conditions (i) 
and (ii) can be stated in words as requiring that ‘forward’ edges of 
the path are unsaturated while ‘backward’ edges are positive—all 
with respect to a given flow f. Suppose there exists such a special 
path from s to t. Define a; as c(e) — f(e) in the first case and as 
f(e) in the second case (picking one of the edges to use if both cases 
hold) and let a be the minimum of these positive numbers a;. On 
each edge of type (i) increase the flow value by a, and on each edge 
of type (ii) decrease the flow by a. It is easy to check that the two 
conditions for a flow (feasibility and conservation of flow) are still 
satisfied. Clearly the new flow has strength |f| + a. 

This idea for obtaining a stronger flow becomes an algorithm 
when we iterate it (starting with the 0-flow) and incorporate a 
systematic procedure for searching for special paths from s to t 
with respect to the current flow. We make brief remarks concerning 
termination in the notes to this chapter. But what happens when 
we can go no further? 

Suppose that no special path from source to sink exists with re- 
spect to some flow fp. Let Xo be the set of vertices x which can be 
reached from s by a special path, Yo the set of remaining vertices. 
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In this way we produce a cut. If x € Xo, y € Yo and e = (2, y) is 
an edge, then e must be saturated or we could adjoin y to a special 
path from s to x to get a special path from s to y, contradicting 
the definitions of Xo and Yo. If, on the other hand, e = (y, x) is an 
edge, then, for a similar reason, f(e) must be 0. In view of (7.1), 
we have then 


| fol = fo( Xo, Yo) — fo(Yo, Xo) = c(Xo, Yo). 


Now it is clear that not only can no stronger flow be obtained by 
our method of special paths, but that no stronger flows exist at all 
because |f| < c(Xo, Yo) for any flow f. 

If fo is chosen to be a maximum flow (which exists by continuity 
reasons in case one is unsure of the termination of the algorithm), 
then surely no special paths from s to ¢t exist. Note that the con- 
structed cut (Xo, Yo) is a minimum cut (i.e. a cut of minimum ca- 
pacity), since c(X,Y) > |fo| for any cut (X,Y). Our observations 
have combined to prove the following famous theorem of Ford and 
Fulkerson (1956). 


Theorem 7.1. In a transportation network, the maximum value 
of | f| over all flows f is equal to the minimum value of c(X, Y ) over 
all cuts (X,Y). 


This theorem is usually referred to as the ‘maxflow-mincut’ the- 
orem. The procedure for increasing the strength of a flow that we 
used above shows somewhat more. 


Theorem 7.2. If all the capacities in a transportation network 
are integers, then there is a maximum strength flow f for which all 
values f(e) are integers. 


PROOF: Start with the 0-flow. The argument above provides a way 
to increase the strength until a maximum flow is reached. At each 
step @ is an integer, so the next flow is integer valued too. 


Problem 7A. Construct a maximum flow for the transportation 
network of Fig. 7.1. 


Problem 7B. An elementary flow in a transportation network is 
a flow f which is obtained by assigning a constant positive value 
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a to the set of edges traversed by a simple (directed) path from s 
to t, and 0 to all other edges. Show that every flow is the sum of 
elementary flows and perhaps a flow of strength zero. (This means 
we can arrive at a maxflow by starting from the 0-flow and using 
only special paths with ‘forward’ edges.) Give an example of a 
network and a flow which is not maximum, but with respect to 
which there are no special paths using only ‘forward’ edges. 


Problem 7C. Let (Xj, Y;) and (X92, Yj) be minimum cuts (i.e. cuts 
of minimum capacity) in a transportation network. Show that (X,U 
X2, Yi Y2) is also a minimum cut. (This can be done either from 
first principles, or with an argument involving maximum flows.) 


Problem 7D. Prove P. Hall’s marriage theorem, Theorem 5.1, 
from Theorems 7.1 and 7.2. 


It should be clear that the topic of this chapter is of great practi- 
cal importance. Routing schemes for all kinds of products depend 
on algorithms that produce optimal flows through transportation 
networks. We do not go into the algorithmic aspect of this area. 
Instead, we shall show a beautiful application of Theorem 7.2 to a 
problem related to Birkhoff’s theorem, Theorem 5.5. Before giving 
the theorem and its proof, we observe that several attempts were 
made to prove it by reducing it to Theorem 5.5 but with no success. 
The proof below is due to A. Schrijver. (If 6 = v in Theorem 7.3, 
then we have the situation of Theorem 5.5.) 


Theorem 7.3. Let A beab x v (0, 1)-matrix with k ones per row 
and r ones per column (so bk = vr). Let a be a rational number, 
0 <a <1, such that k' = ak and r' = ar are integers. Then there 
is a (0, 1)-matrix A’ of size b x v with k' ones per row and r’ ones 
per column such that entries a}; of A’ are 1 only if the corresponding 
entries of A are 1, i.e. A’ can be obtained from A by changing some 


ones into zeros. 


Proor: We construct a transportation network with vertices s (the 
source), 11,..., 2, (corresponding to the rows of A), y1,..., Yy (cor- 
responding to the columns of A), and t (the sink). Edges are (s, x;) 
with capacity k, 1 <i < b, (a;,y;) with capacity 1 if and only if 
ajj = 1, and (y;,t) with capacity r, 1 < 7 < v. The definition 
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ensures that there is a maximum flow with all edges saturated. 
We now change the capacities of the edges from the source to k’ 
and those of the edges to the sink to r’. Again, all the capacities 
are integers and clearly a maximum flow exists for which the flows 
f((ai, y;)) are equal to a. By Theorem 7.2 there is also a maximum 
flow f* for which all the flows are integers, ie. f*((xi,y;)) = 0 or 
1. From this flow, we immediately find the required matrix A’. 


The theorem above can be generalized in several ways with es- 
sentially the same proof idea, but see below for a slightly different 
approach. 

For some combinatorial applications, it is convenient to use the 
following theorem, which does not require the introduction of ca- 
pacities or the concept of strength. It can be derived from Theorem 
7.2—see Ford and Fulkerson (1956)—but we choose to give a direct 
proof. 

A circulation on a digraph D is a mapping f from E(D) to the 
reals satisfying conservation of flow at every vertex. We do not 
require nonnegativity. Circulations may be identified with vectors 
in the null space of the incidence matrix of the digraph. 


Theorem 7.4. Let f be a circulation on a finite digraph D. Then 
there exists an integral circulation g such that for every edge e, g(e) 
is equal to one of | f(e)| or | f(e)]. 


We may say that the values of g are those of f ‘rounded up or 
down’. Of course, if f(e) is already an integer, then g(e) = f(e). 


PROOF: Given acirculation f, consider a circulation g that satisfies 


(7.2) Lf(e)] < g(e) < fle) 


and for which the number of edges e with g(e) an integer is as large 
as possible subject to (7.2). 

Let H be the spanning subgraph of D with edge set consisting of 
those edges of D for which g(e) is not an integer, i.e. for which strict 
inequality holds both times in (7.2). Conservation of flow implies 
that no vertex can have degree 1 in H, so if g is not integral, then 
H contains a polygon. 

Let P be a polygon in H and traverse P with a simple closed 
path; let A be the set of edges of P that are forward edges of the 
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path in D, and B the set of edges of P that are backward edges in 
this path. For any constant c, we obtain a new circulation g’ by 


g(e) +e ifeeA, 
g(e):=< gle)-—c ifeeB, 
g(e) ife ¢ E(P). 


If c is small, (7.2) will still hold with g replaced by g’. Now choose 


c= min { min (T/(e)] - g(e)), min (9(e) - Lf(e)J) }. 
e€A e€B 

Then g’ still satisfies (7.2), yet g/(e) is an integer for at least one 

more edge (any edge for the which term in the expression above 

achieves the minimum). This would contradict the choice of g, 

were g not integral. 


Corollary. Let f be an integral circulation on a finite digraph 
D and d any positive integer. Then f can be written as the sum 
gi t+g2+-::+ 9a of integral circulations such that for each index j 
and each edge e, 


(7.3) Lf(e)/d] < gj(e) < [fle)/dl. 


PROOF: By induction on d. For d = 1, there is nothing to prove. 
Given d > 2, apply Theorem 7.4 to f/d to find an integral circu- 
lation g; satisfying (7.3) for 7 = 1. Apply the induction hypothesis 
to find 
f-Mm=92+93+---+ 94 


where for each j = 2,3,...,d, g; is an integral circulation satisfying 


L(F(e) — ile) /(d— I] < ale) < [F(@) — mle) /(@— DI. 


An easy exercise is that if a is an integer and 6 is either |a/d| or 
[a/d], then 


ee 


so that the above inequalities imply (7.3) for 7 = 2,3,...,d. 
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From an m X n matrix A of real numbers a;;, not necessarily 
nonnegative or integers, we obtain a circulation on a digraph with 
m+n-+ 2 vertices and mn+m-+n-+1 edges. The digraph is 
similar to the one used in the proof of Theorem 7.3. There are ver- 
tices %1,...,2%2m corresponding to the rows, vertices yj,..., Yn Cor- 
responding to the columns, and two others called s and t. There 
is an edge from x; to y; with circulation value a;;, an edge from s 
to x; with circulation value equal to the i-th row-sum r;, an edge 
from y; to t with circulation value equal to the j-th column-sum k; 
(i =1,...,m, 7 =1,...,n), and an edge from t to s with circula- 
tion value equal to the sum of all entries of M. If we multiply this 
circulation f by any scalar a, apply Theorem 7.4 to af, and rein- 
terpret the resulting integral circulation as a matrix, we obtain part 
(i) of the following theorem. Part (ii) follows from the corollary. 


Theorem 7.5. (i) Given a matrix A and a real number a, there 
is an integral matrix B so that the entries of B, the row-sums of 
B, the column-sums of B, and the sum of all entries of B, are the 
corresponding values for aA rounded up or down. (ii) If A is an 
integral matrix and d any positive integer, then 


A=B,+Bot-:-+Bag 


where each B; is an integral matrix whose entries, row-sums, column- 
sums, and sum of all entries, are those of (1/d)A, rounded up or 
down. 


Problem 7E. Show that the following results are quick conse- 
quences of Theorem 7.5: (i) Problem 5A(iii); (ii) Theorem 5.5; (iii) 
Theorem 7.3; (iv) A finite graph all of whose vertices have even de- 
gree has a balanced orientation, where the in-degree and out-degree 
of each vertex are equal; (v) If a bipartite graph has minimum de- 
gree d and maximum degree d, then its edges may be colored with 
d colors so that the colors that appear at every vertex are distinct, 
and with d colors so that all colors appear at each vertex. 


Problem 7F. Show that the dimension of the vector space of all 
circulations on a connected digraph D is |E(D)| — |V(D)| +1. 
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Notes. 

The term augmenting path is often used instead of special path. 

If the capacities of a transportation network are integers, the spe- 
cial path method for constructing maximum flows will terminate af- 
ter finitely many iterations, since the strength increases by at least 
one each time. But Ford and Fulkerson (1962) give an example with 
irrational capacities where certain contrived choices of special paths 
lead to an infinite sequence of flows whose strengths converge—but 
only to one-fourth of the actual maximum flow strength! If one 
is careful to pick shortest special paths, however, then it can be 
shown that a maximum flow is reached after at most O(n?) itera- 
tions, where n is the number of vertices. See Edmonds and Karp 
(1972). 

The problem of finding a maximum flow is an example of a linear 
programming problem and can be solved e.g. by the simplex algo- 
rithm. The network flow problem is special in that its matrix is 
totally unimodular, and this is one way of explaining why Theorem 
7.2 holds. See the references below for more discussion of linear and 
integer programming. Graphical methods are usually faster than 
the simplex algorithm, and add insight. 

Circulations on a digraph are called 1-cycles in algebraic topol- 
ogy. An analogue of Theorem 7.4 holds for vectors f in the null 
space of any totally unimodular matrix. 

Theorems 7.1, 7.2, 7.4, and the algorithm have many further 
combinatorial applications, since certain combinatorial problems 
can be phrased in terms of transportation networks. For example, 
finding a maximum matching in a bipartite graph is equivalent to 
finding a maximum (integer valued) flow in a certain associated 
network—see the references—and thus a good algorithm exists to 
find a maximum matching. We give further applications of these 
theorems in Chapter 16 to an existence problem on (0,1)-matrices, 
and in Chapter 38 to a problem on partitions of sets. 
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De Bruijn sequences 


The following problem has a practical origin: the so-called rotating 
drum problem. Consider a rotating drum as in Fig. 8.1. 


Figure 8.1 


Each of the segments is of one of two types, denoted by 0 and 1. 
We require that any four consecutive segments uniquely determine 
the position of the drum. This means that the 16 possible quadru- 
ples of consecutive 0’s and 1’s on the drum should be the binary 
representations of the integers 0 to 15. Can this be done and, if yes, 
in how many different ways? The first question is easy to answer. 
Both questions were treated by N. G. de Bruijn (1946) and for this 
reason the graphs described below and the corresponding circular 
sequences of 0’s and 1’s are often called De Bruijn graphs and De 
Bruijn sequences, respectively. 

We consider a digraph (later to be called G,) by taking all 3- 
tuples of 0’s and 1’s (i.e. 3-bit binary words) as vertices and joining 
the vertex 212%2x%3 by a directed edge (arc) to x9730 and 22731. 
The arc (#1 %2%3,£%2%3%4) is numbered e;, where x, 279x324 is the 
binary representation of the integer 7. The graph has a loop at 000 
and at 111. As we saw before, the graph has an Eulerian circuit 
because every vertex has in-degree 2 and out-degree 2. Such a 
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closed path produces the required 16-bit sequence for the drum. 
Such a (circular) sequence is called a De Bruijn sequence. For 
example the path 000 — 000 — 001 — 011 — 111 — 111 — 110 — 
100 — 001 — 010 — 101 — 011 — 110 — 101 — 010 — 100 — 000 
corresponds to 0000111100101101 (to be read circularly). We call 
such a path a complete cycle. 

We define the graph G,, to be the directed graph on (n—1)-tuples 
of 0’s and 1’s in a similar way as above. (So G’, has 2” edges.) 

The graph Gy, is given in Fig. 8.2. In this chapter, we shall call a 
digraph with in-degree 2 and out-degree 2 for every vertex, a ‘2-in 
2-out graph’. For such a graph G we define the ‘doubled’ graph G* 
as follows: 

(i) to each edge of G there corresponds a vertex of G*; 

(ii) if a@ and b are vertices of G*, then there is an edge from a to 
b if and only if the edge of G corresponding to a has as terminal 
end (head) the initial end (tail) of the edge of G corresponding to 
b. 

Clearly G* = Gn41. 


0 
1 Mat 
Oe 


Hit 
Figure 8.2 


Theorem 8.1. Let G be a 2-in 2-out graph on m vertices with M 
complete cycles. Then G* has 2™ !M complete cycles. 


PROOF: The proof is by induction on m. 
(a) If m = 1 then G has one vertex p and two loops from p to p. 
Then G* = G2 which has one complete cycle. 


&. De Bruijn sequences 73 


(b) We may assume that G is connected. If G has m vertices 
and there is a loop at every vertex, then, besides these loops, G is 
a circuit pj > py +--+ > Dm — py. Let A; be the loop p; — p; and 
B; the arc p; — pjisi. We shall always denote the corresponding 
vertices in G* by lower case letters. The situation in G* is as in 
Fig. 8.3. 


Figure 8.3 


Clearly a cycle in G* has two ways of going from }; to bj41. So 
G* has 2’"-' complete cycles, whereas G has only one. 

(c) We now assume that G has a vertex x that does not have 
a loop on it. The situation is as in Fig. 8.4, where P,Q, R,S are 
different edges of G (although some of the vertices a,b,c,d may 
coincide). 

From G we form a new 2-in 2-out graph with one vertex less by 
deleting the vertex x. This can be done in two ways: G is obtained 
by the identification P = R, Q = S, and G» is obtained by P = S, 
Q = R. By the induction hypothesis, the theorem applies to G 
and to Go. 


e b Pp q 
G c . o 
R S 
c d r s 
Figure 8.4 


There are three different types of complete cycle in G*, depending 
on whether the two paths leaving r and returning to p, respectively 
q, both go to p, both to q, or one to p and one to q. We treat one 
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case; the other two are similar and left to the reader. In Fig. 8.5 
we show the situation where path 1 goes from r to p, path 2 from 
s to q, path 3 from s to p, and path 4 from r to q. 


oo 
Pa 


£» 


Figure 8.5 


These yield the following four complete cycles in G*: 


1, pr, 4, qs, 3, ps, 2, ar 

1, ps, 2, qr, 4, qs, 3, pr 

1, ps, 3, pr, 4, qs, 2, ar 

1, ps, 2, qs, 3, pr, 4, @r 
In Gj and G5 the situation reduces to Fig. 8.6. 

4 2 

Gt G3 


Figure 8.6 


In each of G} and G3 one complete cycle using the paths 1, 2,3, 4 
is possible. In the remaining two cases, we also find two complete 
cycles in Gj and G5 corresponding to four complete cycles in G*. 
Therefore the number of complete cycles in G* is twice the sum of 
the numbers for G} and G5. On the other hand, the number of 
complete cycles in G is clearly equal to the sum of the correspond- 
ing numbers for G; and Gy. The theorem then follows from the 
induction hypothesis. 


We are now able to answer the question how many complete 
cycles there are in a De Bruijn graph. 
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Theorem 8.2. G,, has exactly 22". '~" complete cycles. 


PROOF: The theorem is true for n = 1. Since GF = Gri, the 
result follows by induction from Theorem 8.1. 


For a second proof, see Chapter 36. 


Problem 8A. Let a be a primitive element in Fyn. For 1 <i < 
m := 2” —1, let 


n-1 


=) 
| 


Cyr. 


&, 
ll 
° 


Show that the sequence 


0, C10, C205 +++ Cmo 
is a De Bruijn sequence. 


Problem 8B. Find a circular ternary sequence (with symbols 
0,1,2) of length 27 so that each possible ternary ordered triple 
occurs as three (circularly) consecutive positions of the sequence. 
First sketch a certain directed graph on 9 vertices so that Eulerian 
circuits in the graph correspond to such sequences. 


Problem 8C. We wish to construct a circular sequence ag,...,@7 
(indices mod 8) in such a way that a sliding window aj, a;+1, ai+3 
(i = 0,1,...7) will contain every possible three-tuple once. Show 
(not just by trial and error) that this is impossible. 


Problem 8D. Let m := 2” — 1. An algorithm to construct a De 
Bruijn sequence ag, @1,...,@m works as follows. Start with aj = 
a, = ++: = @,_, = 0. For k > n, we define a; to be the maximal 
value in {0,1} such that the sequence (ag—n41,---,@%-1, 4%) has 
not occurred in (ao,...,@%—1) aS a (consecutive) subsequence. The 
resulting sequence is known as a Ford sequence. Prove that this 
algorithm indeed produces a De Bruijn sequence. 


Notes. 

Although the graphs of this chapter are commonly called De 
Bruijn graphs, Theorem 8.1 was proved in 1894 by C. Flye Sainte- 
Marie. This went unnoticed for a long time. We refer to De Bruijn 
(1975). 
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N. G. de Bruijn (1918—), one of the best-known Dutch mathe- 
maticians, worked in many different areas such as analysis, number 
theory, combinatorics, and also computing science and crystalogra- 
phy. 

We mention a peculiarity concerning the spelling of some Dutch 
names. When omitting the initials of N. G. de Bruijn, one should 
capitalize the word ‘de’ and furthermore the name should be listed 
under B. Similarly Van der Waerden is correct when the initials are 
omitted and he should be listed under W. 

For a proof of Theorem 8.1 using algebraic methods, we refer to 
Chapter 36. 
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9 
Two (0,1,*) problems: 


addressing for graphs and 
a hash-coding scheme 


The following problem originated in communication theory. For 
a telephone network, a connection between terminals A and B is 
established before messages flow in either direction. For a network 
of computers it is desirable to be able to send a message from A 
to B without B knowing that a message is on its way. The idea is 
to let the message be preceded by some ‘address’ of B such that 
at each node of the network a decision can be made concerning the 
direction in which the message should proceed. 


A natural thing to try is to give each vertex of a graph G a 
binary address, say in {0,1}*, in such a way that the distance of two 
vertices in the graph is equal to the so-called Hamming distance of 
the addresses, i.e. the number of places where the addresses differ. 
This is equivalent to regarding G as an induced subgraph of the 
hypercube Hy, which has V(H,) := {0,1}* and where k-tuples are 
adjacent when they differ in exactly one coordinate. The example 
G = kz; already shows that this is impossible. We now introduce a 
new alphabet {0,1,*} and form addresses by taking n-tuples from 
this alphabet. The distance between two addresses is defined to be 
the number of places where one has a 0 and the other a 1 (so stars 
do not contribute to the distance). For an addressing of a graph 
G, we require that the distance of any two vertices in G is equal to 
the distance of their addresses. It is trivial to show that this can be 
done if n is large enough. We denote by N(G) the minimum value 
of n for which there exists an addressing of G with length n. 
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For a tree we can do without the stars as follows. We use in- 
duction. For a tree with two vertices, we have a trivial addressing 
with length 1. Suppose that we can address trees with k vertices. 
If 20, %1,...,@, are the vertices of the tree T and xp is a monova- 
lent vertex, then consider an addressing for the tree obtained by 
removing x9. Let x; be the address of x; and suppose 29 is joined 
to x,. We change all addresses to (0,x;), 1 <i < k, and give xo 
the address (1, x1). Clearly this is now an addressing for T’. So for 
a tree, we have N(T) < |V(T)| -1. 

As a second example, consider K,,. In the identity matrix of 
size m — 1, we replace the zeros above the diagonal by stars and 
add a row of zeros. Any two rows now have distance 1 and hence 
N(K pm) <m-—1. 

As a third example, we consider the graph of Fig. 9.1. 


3 


4 
Figure 9.1 


A possible (though not optimal) addressing is 


oF WN FR 
oo * Fe 
OO 31> 
oro x* & 
SO One 
ox FX *¥ 


We now show a correspondence between addressings of a graph 
and quadratic forms (an idea of Graham and Pollak, 1971). Con- 
sider the graph G of Fig. 9.1 and the addressing given above. 
To the first column of the addressing, we associate the product 
(v1 +2%2)(%4+25). Here 2; is in the first, respectively second, factor 
if the address of 7 has a 1, respectively a 0, in the first column. 
If we do the same thing for each column and then add the terms, 
we obtain a quadratic form )°¢ d,;7;2;, where d;; is the distance of 
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the vertices 2 and 7 in G. Thus an addressing of G corresponds to 
writing the quadratic form )¢ d;;x;2; as a sum of n products 


(te Paes + 5,) (i, are + 24) 


such that no x; occurs in both of the factors. The number of vari- 
ables is |V(G)|. 


Theorem 9.1. Let n,, respectively n_, be the number of positive, 
respectively negative, eigenvalues of the distance matrix (d;;) of the 
graph G. Then N(G) > max{n;,n_}. 


PRooF: Each of the quadratic forms mentioned above can be rep- 
resented as 5x! Ax, where x := (21,%9,...,%n) and A has entry 
aj; = 1 if the term x;2,; occurs in the quadratic form and 0 other- 
wise. Such a matrix has rank 2 and trace 0. Therefore it has one 
positive and one negative eigenvalue. Since (d;;) is the sum of the 
matrices corresponding to the quadratic forms, it can have at most 


n positive (respectively negative) eigenvalues. 


Theorem 9.2. N(K,,) =m-—1. 


PROOF: We have already seen that N(Ki,) <m-—1. Since J — J, 
of size m, is the distance matrix of K,,, and the eigenvalues of J —J 
are m—1, with multiplicity 1, and —1, with multiplicity m—1, the 
result follows from Theorem 9.1. 


With slightly more work, we shall now show that the shortest 
addressing for a tree T has length |V(T)| — 1. 


Theorem 9.3. If T is a tree on n vertices, then N(T) =n —1. 


PROOF: We first calculate the determinant of the distance matrix 
(d;;) of T. We number the vertices pi, ..., pp in such a way that pp, is 
an endpoint adjacent to p,_1. In the distance matrix, we subtract 
row n—1 from row n, and similarly for the columns. Then all 
the entries in the new last row and column are 1 except for the 
diagonal element which is equal to —2. Now renumber the vertices 
Pi,-++;Pn—1 in such a way that the new vertex p,_; is an endpoint 
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of T\{p,} adjacent to pp_2. Repeat the procedure for the rows and 
columns with numbers n— 1 and n— 2. After n — 1 steps, we have 
the determinant 


0 1 1 1 
LZ. 0 0 
1 0. =2 0 
1 O 0 ay 


From this we find the remarkable result that the determinant D,, 
of the distance matrix of a tree on n vertices satisfies 


Dy = (-1)""1(n = Lo 2, 


i.e. it depends only on |V(T)|. If we number the vertices accord- 
ing to the procedure described above, then the k x k principal 
minor in the upper left-hand corner of the distance matrix is the 
distance matrix of a subtree on k vertices. Therefore the sequence 
1, Dy, Do,..., Dn, where D; is the determinant of the k x k minor, 
is equal to 


1A a1 eee 1) no, 


If we consider the sign of 0 to be positive, then this sequence has 
only one occurrence of two consecutive terms of the same sign. By 
an elementary theorem on quadratic forms this implies that the 
corresponding quadratic form has index 1, and hence (d;;) has one 
positive eigenvalue; see B. W. Jones (1950), Theorem 4. Now the 
result follows from Theorem 9.1. 


The conjecture that in fact N(G) < |V(G)|—1 for all (connected) 
graphs G was proved by P. Winkler in 1983. The proof is construc- 
tive. In order to describe the addressing, we need some preparation. 
Consider the graph of Fig. 9.2. 
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Figure 9.2 


We pick a vertex x9, then construct a spanning tree JT’ by a 
breadth-first search, and then number the vertices by a depth-first 
search. The result is shown on the right-hand side of Fig. 9.2, where 
edges of E(G)\E(T) are dashed. 


Let n := |V(G)| — 1. We need several definitions. 
For i <n, we define 


P(i) := {j : @; is on a path from ap to a; in T}. 
For example, P(6) = {0,3, 4,6}. Let 
iAZ = max(PC)NPO)): 


We describe the general situation in Fig. 9.3. 


Og en; 
es SS Qs 
to TiAj “Ne. ga z 

~~~ -@--7 vj 
Zz 


Note that in Fig. 9.3, we have 7 < j if and only if k < l. 
For i <n, we define 


e = tax Pa) Stab): 
For example, 7’ = 3 in Fig. 9.2. Define 


i~ j  P(i) S P(9) or PG) C Pl). 
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We denote distances in G, respectively T’, by dg, respectively dr. 
The discrepancy function c(i, 7) is now defined by 


C9): Sd da (ta) 


For example, in Fig. 9.2, c(6,9) = 4. 


Lemma 9.4. 
(i) c(t, 7) = eG, t) = 0; 
(ii) ifi ~ j, then c(i, 7) = 0; 
(iii) if ¢ 7, then c(i, 7’) < c(i,j) < cli, 7’) +2. 


PROOF: (i) is trivial; (ii) follows from the definition of T since 
dg(xi,@;) = \de(x;, 20) — de(i, to)| = dr(i, v5); 


(iii) follows from the fact that |dg(x;, x;) —dg(xi, x;’)| < 1 and that 
dy (aj, x3) = 1+ dr(aj, £57). 

Now we can define the addressing. For 0 <2 < n the vertex 2; 
is given the address a; € {0,1,*}”", where 


ay = (a;(1), a;(2), wees a;(n)) 


and 
1 if 7 € P(d), 
c(t, j) = 3 Ue) = 2, or 
a(j):= << * iff ce(t,7) —c(i,7’) =1, i< J, cli,7) even, or 


Ca, 5 — Cf) = 1 > 9-9) add, 
0 otherwise. 


Theorem 9.5. d(aj,ax) = de(xi, £x). 


PROOF: We may assume 7 < k. 

(i) Suppose i ~ k. Then dg(a;, x.) = |P(k)\P(2)|. The values of 
j such that 7 € P(k)\P(z) are exactly the positions where az(j) = 
1, aj(j) # 1. For these values of 7 we see that c(i,7) = 0, hence 
a;(7) = 0 and we are done. 

(ii) The hard case is when i % k. The key observation is the 
following. Let ny < ng <--- <n be a nondecreasing sequence of 
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integers such that |nji1 — n;| < 2 for all 7. If m is an even integer 
between n; and n; that does not occur in the sequence, then there 
is an 7 such that nj = m—1, nj. = m+ 1. Now consider the 
sequence 


eli, k) > ea, k ) > eG, hk") > <> e(, ik) = 0: 


By the definition of a;(j7) and the observation above, a;(j) = * and 
ax(j) = 1 exactly as many times as there are even integers between 
c(i,tAk) and c(i,k). Similarly ag(j) = * and aj(j) = 1 as many 
times as there are odd integers between c(i,iAk) and c(i, k). So 


d(aj, ar) = |P(k)\P(4)| + |P(t)\P(k)| — eG, k) 
= dr ijt) — Clk) = dol ay, Ze). 


Therefore we have proved the following theorem. 


Theorem 9.6. N(G) < |V(G)| — 1. 


Problem 9A. If we use the scheme defined above, what are the 
addresses of x2 and x¢ in Fig. 9.2? 


Problem 9B. Let G be a cycle (polygon) on 2n vertices. Deter- 
mine N(G). 


Problem 9C. Let G be acycle (polygon) on 2n+1 vertices. Prove 
that N(G) = 2n. Hint: if C, is the permutation matrix with 
entries c;; = 1 if and only if j —i = 1 (mod k) and ¢* = 1, then 
(1,¢,¢7,...,¢*-+) is an eigenvector of Cy. 


We now look at a second problem involving k-tuples from the 
alphabet {0,1,*}. The objects we shall study were introduced by 
Rivest (1974) and given the (unfortunate) name associative block 
design; cf. Chapter 19 for block designs. An ABD(k, w) is a set of 
b := 2” elements of {0,1,x}* with the following properties: if the 
elements are the rows of a b x k matrix C, then 
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(i) each row of C has k — w stars; 

(ii) each column of C has b(k — w)/k stars; 

(iii) any two distinct rows have distance at least 1. 

Note that the definition implies that each vector in F§ has dis- 
tance 0 to exactly one row of C. 

The origin of the problem is the following. Consider a file of k-bit 
binary words. Each sequence in {0,1,x}" is called a partial match 
query. The partial match retrieval problem is to retrieve from the 
file all words agreeing with the query in those positions where the 
query specifies a bit. So-called hash-coding schemes divide a file 
into 6 disjoint lists L1, Do,..., 2). A record x will be stored in the 
list with index h(x), where h is the ‘hash-function’ mapping {0, 1}* 
onto {1,2,...,b}. For a given partial match query, some of the lists 
must be searched. An analysis of the worst-case number of lists to 
be searched led to the concept of ABD. In this case h(x) is the 
index of the unique row of C’ which has distance 0 to x. 


Example 9.1. The following matrix is an ABD (4, 3): 


Perr + OOCOO + 
Se ee ee 
Prt OrOGO* FO 
ee 


We first prove some elementary properties of an ABD. 


Theorem 9.7. If an ABD(k,w) exists, then: 


(1) it has exactly bw/(2k) zeros and bw/(2k) ones in each col- 
umn; 

(2) for each x in F it has exactly (“) rows which agree with x 
in u positions; 

(3) the parameters satisfy 


1 
*>2k(1-=); 
w? > 2k ( 5) 
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(4) for any row, the number of rows with stars in the same 
positions is even. 


Proor: Let C be the ABD(k,w). 

(1) A row of C with a star, respectively a zero, in column j rep- 
resents (i.e. has distance 0 to) 2*-"~!, respectively 2*~”, elements 
of FS. From (i) and (ii) of the definition, it follows that column j 
must contain bw/(2k) zeros. 

(2) Let x € FS. Denote by n; the number of rows of C which 
agree with x in 7 positions. There are (7) vectors in F§ which agree 


k—-w 


with x in exactly | positions. Therefore (‘) = ni( a i: i.e. 


(1+2)'=(1+z)*"- So mz’. 


This proves that nj; = (‘). 

(3) The sum of the distances between pairs of rows of C is k(t)? 
by (1). Since any two rows have distance at least 1, this sum is at 
least C). 

(4) Consider a row of C. Count vectors in F§ which have zeros 
in the positions where the row has stars. Each row with a different 
star pattern represents an even number of such vectors whereas a 
row with the same star pattern represents exactly one such vector. 


Note that property (1) in Theorem 9.7(3) implies that a neces- 
sary condition for the existence of an ABD(k, w) is that k divides 
ee ma 

The following strengthening of Theorem 9.7(3) is due to A. E. 
Brouwer (1999). 


Theorem 9.8. Let C be an ABD(k,w) with w > 3. 


(1) If two rows of C agree in all but one position, then 


(2) otherwise w? > 2k. 
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PROOF: Suppose c, and cy are two rows of C’ which differ only 
in position one. Then all the other rows of C’ must differ from c; 
in some other position. So, by (i) of the definition and Theorem 


9.7(3), we find ; 
w 
b-2<(w—1)- 5. 
To prove the assertion, we must show that the right-hand side of 
the inequality cannot be equal to b — 2 or b— 1. In both cases, 
equality would imply 2”~'|k which contradicts Theorem 9.7 unless 
w = 4, which is excluded by substitution. 

(ii) Consider two rows of C’ which have the same star pattern. 
By hypothesis, they differ in more than one position. Again, count 
the sum of the distances of all the rows from one of this pair. This 
sum is at least 2+ (b — 2) = b and, by Theorem 9.7.1, it is equal 
to w- (bw)/(2k). So w? > 2k. We must show that equality cannot 
hold. By the argument above, equality would imply that rows with 
the same star pattern occur in pairs which have distance 2, and 
furthermore all the other rows have distance 1 to each row of such 
a pair. Without loss of generality, such a pair would be 


(x *-+-*00...000) and («x---*00...011). 


The bw/(2k) — 1 other rows ending in a 1 would have to end in 
01, for otherwise they would have distance 0 to the second row or 
distance > 1 to the first row. Similarly, there would be bw/(2k) —1 
rows ending in 10. Since we now have rows with distance 2, we 
find that necessarily bw/(2k) —1 = 1. Therefore 2” = 2w, which is 
impossible if w > 3. 
Corollary. An ABD(8,4) does not exist. 
Using these results, it is easy to find all ABD(k, w) with w < 4. 
Of course, w = 0 is trivial. For w = 1,2, or 4, we must have k = w 


(no stars). If w = 3, then either k = 3 (no stars) or k = 4. In that 
case there are two types of ABD, one given in Example 9.1. 


Problem 9D. Construct an ABD(4,3) that has the same first 
four rows as Example 9.1 but differs in the others. 


In 1987 La Poutré and Van Lint proved that an ABD(10,5) 
does not exist but the smaller possibility ABD(8,5) turned out to 
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be quite difficult to handle. In 1999 D. E. Knuth asked Brouwer 
whether any progress had been made in this area since the early 
results and this made Brouwer decide it was time the question was 
settled. An example can be found in Brouwer (1999). It does not 
seem to have any structure. Now the smallest open case is the 
question whether an ABD(12,6) exists. 

We shall now describe some construction methods. Some of the 
ideas will be used in other chapters. 


Theorem 9.9. If an ABD(k;,w;) exists for i = 1,2, then an 
ABD (kiko, wiw2) exists. 


PRooF: We can assume wy > 0. Partition the rows of ABD (kz, w2) 
into two classes Ry and R, of equal size. In ABD(k,, wi) we replace 
each star by a row of ky stars, each 0 by a row from Ro and each 1 
by a row from R in all possible ways. A trivial calculation shows 
that the resulting matrix is an ABD(kyko, ww). 


Corollary. An ABD(4‘,3') exists. 


For the proof of the next theorem, we introduce a new symbol, 
namely —. A k-tuple consisting of the symbols 0,1,*, and — rep- 
resents all possible words with only 0,1,* that can be obtained by 
replacing each — by a 0 or a 1 in all possible ways. 


Theorem 9.10. Let w > 0. Suppose an ABD(k,w) exists, where 
k = ko- 2', ko odd. Then an ABD(k,w + iko) exists for0 <i < 
(k — w)/ko. 


ProoF: It is sufficient to consider i = 1. Let C be the ABD(k, w). 
Define a matrix A of the same size by requiring a;; = 1 if Cj; = x 
and a;; = 0 otherwise. By Theorem 7.3, A is the sum of two 
matrices A; and Ay, where A; has ko ones in each row and 2”! 
ones in each column. In a row of C, replace stars by — if the star 


occurs in a position where A, has a one. This produces the required 
ABD(k, w + ko). 


Theorem 9.11. If ABD(k,w) exists and a > 1 is a number such 
that ak and aw are integers, then an ABD(ak, aw) exists. 


ProoF: It is sufficient to show that ABD(k + 1,w +m) exists for 
(k +1)/(w +m) = k/w and (l,m) = 1. Let k = ko - 2°, ko odd. 
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From (ii) of the definition we know that ko|w. Therefore wl = mk 
and (l,m) = 1 imply that / is a power of 2. Consider the 1 x 1 
circulant matrix with a row of 1 — m stars and m minus signs as 
first row. Since | divides b we can adjoin a column of b/I copies 
of this circulant to the matrix C of the ABD(k,w). It is easy to 
check that this larger matrix is an ABD(k+1,w+m). 


Example 9.2. From the corollary to Theorem 9.9 we have an 
ABD (64, 27). Theorem 9.10 then shows that an ABD(64, w) exists 
for 27 < w < 64. In particular, there is an ABD(64,32). Then 
Theorem 9.11 implies that an ABD(2w, w) exists for all w > 32. 
As mentioned before, nonexistence has been shown for w = 4 and 
w = 5 and the case w = 6 is still open. 


Notes. 

The first problem considered in this chapter was introduced by 
J. R. Pierce at Bell Laboratories as the loop switching problem. 
Several people (including one of the present authors) tried in vain 
to solve it. Shortly after R. L. Graham raised the reward for the 
solution to $200, it was solved by P. Winkler. It is worth noting that 
Winkler stated that the idea of numbering the vertices as was done 
in the proof was a regular habit due to his background in computer 
science. Going over the proof, one sees that this numbering indeed 
played a crucial role. 
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R. L. Graham and H. O. Pollak (1971), On the addressing problem 
for loop switching, Bell System Tech. J. 50, 2495-2519. 
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The principle of inclusion 
and exclusion; 
inversion formulae 


As we have seen in several of the previous chapters, many problems 
of combinatorial analysis involve the counting of certain objects. 
We now treat one of the most useful methods for counting. It 
is known as the principle of inclusion and exclusion. The idea is 
as follows. If A and B are subsets of S and we wish to count 
the elements of S\{AU B}, then the answer is not |S|— |A| — |B| 
because the elements of ANB have been subtracted twice. However 
|S|—|A|—|B|+|ANB| is correct. The following theorem generalizes 
this idea. 


Theorem 10.1. Let S be an N-set; E,,...,E, not necessarily 
distinct subsets of S. For any subset M of {1,...,r}, we define 
N(M) to be the number of elements of S in (),<,\, Ei and for 0 < 
j <7, we define Nj := Y))y)-; N(M). Then the number of elements 
of S not in any of the subsets E;, 1 <i <r, is 


(10.1) N —N, + No—N3+---+(-1)"N,. 


Proor: (i) If x € S and z is in none of the £;, then x contributes 
1 to the expression (10.1). 

(ii) If x € S and z is in exactly k of the sets E;, then the contri- 
bution to (10.1) equals 


i= (‘) +(3) ---+-1(?) =(1-1)'=0. 


90 A Course in Combinatorics 


Remark. If we truncate the sum in (10.1) after a positive (respec- 
tively, negative) term, then we have an upper (respectively, lower) 
bound for the number of elements of S' not in any of the E;. 


Because this method is of great importance, we shall give several 
examples as illustration. 


Example 10.1. Let d, denote the number of permutations 7 of 
1,2,...,m such that a(i) 4 7 for all 7 (these are called derange- 
ments). Let S := S,, and let E; be the subset of those permutations 
m with 7(i) =7. By (10.1) we find 


(10.2) dy = yen'(*) 


i=0 


From this formula, we see that for large values of n the probability 
that a permutation is a derangement is nearly e~!. From (10.2) for 
n and n — 1, we find a recursion formula for d,: 


(10.3) dn = Ndn—1 + (—1)". 


The formula (10.2) can also be obtained by inversion as follows. 
Consider the power series D(x) := S09 dn a (do = 1). Now if 
F(x) := e* D(a), then 


F@)=> 3 (”) tu] e 


m=0 \r=0 


and since dire o (rd m-r = ml, we find F(x) = (1—2z)~1. It follows 
that D(x) = e~*(1 — 2)7! and by multiplying the power series for 
the two ae we find (10.2) again. 


Example 10.2. Let X be an n-set, Y = {y1,..., yx} a k-set. We 
count the surjections of X to Y. Let S be the set of all mappings 
from X to Y, E; the subset of mappings for which y; is not in the 
image of X. By (10.1) we find the number of surjections to be 
SF (—Di(*)(k — i)". Now this number is trivially 0 if k > n and 
clearly n! if k =n. So we have proved: 


wa Bov(jeo(d tet 
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There are many formulae like (10.4) that are often quite hard to 
prove directly. The occurrence of (—1)' is usually a sign that count- 
ing the right kind of objects using the principle of inclusion and ex- 
clusion can produce the formula, as in this example. Nevertheless 
it is useful in this case to see another proof. 

Let P(x) be a polynomial of degree n, with highest coefficient a,. 
We denote the sequence of values P(0), P(1),... by P. We now 
consider the sequence of differences P(1) — P(0), P(2) — P(1),.... 
This is Qi, where Q(x) := P(x +1) — P(x), a polynomial of 
degree n — 1 with highest coefficient na,. By repeating this pro- 
cedure a number of times, we find a sequence Q; whose terms are 
yu (-Di(*) P(a +k —1i), corresponding to the polynomial Q;(x) 
of degree n — k with highest coefficient n(n — 1)...(n —k + L)ay. 
If k = n, then all the terms of Q; are nla, and if k > n, then they 
are all 0. Take P(x) = x”. We again find (10.4). 


Example 10.3. The following identity is a well known relation 
between binomial coefficients: 


(10.5) y-y'(") @ ae ‘ is { - : : 


i=0 


We see that if we wish to prove this using inclusion-exclusion, then 
the sets E; that we wish to exclude involve choosing from an n- 
set, and after choosing i of them, we must choose k — 7 elements 
from some set of size m+n-—i. This shows us that the following 
combinatorial problem will lead us to the result (10.5). Consider a 
set Z = X UY, where X = {a1,...,x2,} is an n-set of blue points 
and Y is an m-set of red points. How many k-subsets consist of red 
points only? The answer is trivially the right-hand side of (10.5). 
If we take S' to be all the k-subsets of Z and E; those k-subsets 
that contain x;, then (10.1) gives us the left-hand side of (10.5). 

Again we can ask whether this result can be proved directly. The 
answer is yes. To do this, we use the following expansion: 


(10.6) & 2 ai =(1—2)*}, 


j=0 
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Note that (—1)'(") is the coefficient of 2’ in the expansion of (1 — 
x)". From (10.6) we find that (ee) is the coefficient of «*~’ in 
the expansion of (1 — x)*-™-""!. So the left-hand side of (10.5) is 
the coefficient of x” in the expansion of (l—a)*"""!. Ifm <k-1, 
this is obviously 0 and if m > k, it is (7), again by (10.6). 


Example 10.4. (The Euler function) Let n = pi'p$?...p% be a 
positive integer. We denote by ¢(n) the number of integers k& with 
1<k<7n such that the g.c.d. (n,k) = 1. We apply Theorem 10.1 
with S := {1,2,...,n} and E; the set of integers divisible by p,, 
1<i<r. Then (10.1) yields 


(ot). HOote ye oe SS eas 


a ae i=l Pi 
The next theorem is used quite often. 


Theorem 10.2. 7, 0(d) =n. 


ProoF: Consider {1,2,...,n} = N. For each m € N, we have 
(m,n)|n. The number of integers m with (m,n) = d, i.e. m= md, 
n = nd and (m,n) = 1 clearly equals ¢(n1) = ¢(n/d). So 
N= Dan (n/a) which is equivalent to the assertion. 


At this point, it is useful to introduce the so-called Mébius func- 
tion: 


(LOS): id) = 
1 if d = product of an even number of distinct primes, 
—1 if d = product of an odd number of distinct primes, 


0 otherwise, i.e. d not squarefree. 


Theorem 10.3. 


ih, ai aed 
d) = 
2 ) { 0 otherwise. 


Proor: If n= 1, there is nothing to prove. If n = p}*... p*”, then 
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by (10.8) we have 


Note how similar the proofs of Theorems 10.1 and 10.3 are. 
Using the Mobius function, we can reformulate (10.7) as follows: 


(10.9) on) =). a 


d\n 
Problem 10A. How many positive integers less than 1000 have 
no factor between 1 and 10? 


Problem 10B. How many monic polynomials of degree n are 
there in F,[z] that do not take on the value 0 for x € F,? 


Problem 10C. Determine es y(n) |=]. 


n 


Problem 10D. One of the most famous functions in complex 
analysis is the so-called Riemann ¢-function ¢(s) := )0°,n~“*, de- 
fined in the complex plane for Re(s)> 1. Prove that 1/¢(s) = 
Se ee 


Problem 10E. Let f;,(z) be the function that has as its zeros all 
numbers 7 for which 7” = 1 but n* 4 1 for 1 <k <n. Prove that 


fn(Z) = T[j@ = 1 yer), 


k|n 
Theorem 10.3 makes it possible to derive a very useful inversion 
formula known as the Mobius inversion formula. 


Theorem 10.4. Let f(n) and g(n) be functions defined for every 
positive integer n satisfying 


(10.10) f(r) = D9), 


d\n 
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(10.11) a(n) = oud f(G) 


d\n 


PRooF: By (10.10) we have 


YH@ FG) = aS) 


d\n d\n 


=u) a) = Ya) SS alr). 


d\n d'|d d’|n m|(n/d’) 


By Theorem 10.3 the inner sum on the right-hand side is 0 unless 
ad hie 


Remark. The equation (10.11) also implies (10.10). 


Example 10.5. We shall count the number JN,, of circular se- 
quences of 0’s and 1’s, where two sequences obtained by a rotation 
are considered the same. Let M(d) be the number of circular se- 
quences of length d that are not periodic. Then N, = Dae M(d). 
We observe that >> din OM (d) = 2” since this counts all possible cir- 
cular sequences. By Theorem 10.4 we find from this equation that 
nM (n) = dan u(d)2"/4 and therefore 


(10.12) Na=S>M(d) => : > W(5)2 


d\n d\n id 


2 k 1 n 
ey a ~ = 


In k|= ijn 


The final expression has the advantage that all the terms are posi- 
tive. This raises the question whether we could have obtained that 
expression by some other counting technique. We shall see that 
the following theorem, known as Burnside’s lemma (although the 
theorem is actually due to Cauchy and Frobenius; see the notes), 
provides the answer. 
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Theorem 10.5. Let G be a permutation group acting on a set X. 
For g € G let u(g) denote the number of points of X fixed by g. 
Then the number of orbits of G is equal to a gcc V(9). 


PROOF: Count pairs (g,xz), where g € G,xe€ X, x4 = x. Starting 
with g, we find >) j<¢v(g). For each x € X there are |G|/|O,| 
such pairs, where O, is the orbit of x. So the total number equals 
IG| So .cx 1/|O,|. The orbits of G partition X, and if we sum the 
terms 1/|O,| over all x in a particular orbit, we obtain 1. Thus 
rex L/|O2| is the number of orbits. 


Example 10.5 (continued). Let G be the cyclic group of order 
n, i.e. the group of rotations of a circular sequence of 0’s and 1’s. 
If d\n there are ¢(n/d) integers g such that (n,g) = d and for each 
such g there are 2% circular sequences that are fixed by the rotation 
over g positions. So Theorem 10.5 immediately yields the result 
(10.12). 


Example 10.6. The following problem, introduced by Lucas in 
1891, is known as the ‘probleme des ménages’. We wish to seat n 
couples at a circular table so that men and women are in alternate 
places and no husband will sit on either side of his wife. In how 
many ways can this be done? We assume that the women have 
been seated at alternate places. Call the ladies 1 to n and the 
corresponding men also 1 to n. The problem amounts to placing 
the integers 1 to n on a circle with positions numbered 1 to n such 
that for all 7 the integer 7 is not in position 7 or position 7 + 1 
(mod n). Let E; be the set of seatings in which husband 1 is sitting 
next to his wife. We now wish to use inclusion-exclusion and we 
must therefore calculate in how many ways it is possible to seat r 
husbands incorrectly. Call this number A,. We do this as follows. 
Consider a circular sequence of 2n positions. Put a 1 in position 
2i — 1 if husband 27 is sitting to the right of his wife; put a 1 in 
position 22 if he is sitting to the left of his wife. Put zeros in 
the remaining positions. The configurations that we wish to count 
therefore are circular sequences of 2n zeros and ones, with exactly r 
ones, no two adjacent. Let A’. be the number of sequences starting 
with a 1 (followed by a 0). By considering 10 as one symbol, we 
see that we must choose r — 1 out of 2n —r—1 positions. To count 
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the number A” of sequences starting with a 0, we place the 0 at the 
end, and then it amounts to choosing r out of 2n —r places. Hence 


A, = Al + Al = 2n-—r—- 1 S: 2n-97r = 2n 2m 
r—l r 2n—-—97r r 


By (10.1) we find that the number of ways to seat the men is 


(10.13) S-yre—mi("-") on 


r 2Qn-Tr 
r=0 


Problem 10F. We color the integers 1 to 2n red or blue in such 
a way that if 7 is red then 7 — 1 is not blue. Prove that 


yen ") gin — On +1, 


k=0 
Can you prove this identity directly? 


Problem 10G. Count the number of permutations 71, 79,..., Xan 
of the integers 1 to 2n such that 2; + a41 4 2n+1 fori = 
VPs ne AL 


Problem 10H. Prove that forO0<k<n 


Notes. 

The principle of inclusion and exclusion occurred as early as 1854 
in a paper by Da Silva and later in a paper by Sylvester in 1883. 
For this reason (10.1) and similar formulae are sometimes called the 
formula of Da Silva, respectively Sylvester. A better name that is 
also often used is ‘sieve formula’. The formula is indeed an example 
of a principle that is used extensively in number theory, referred to 
as ‘sieve methods’. An example that is probably familiar to most 
readers is the sieve of Eratosthenes: to find the primes < n?, take 
the integers <n? and sieve out all the multiples of primes < n. 
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The derangements treated in Example 10.1 occur again in Exam- 
ple 14.1 and Example 14.10. The first occurrence of this question 
is in one of the early books on games of chance: Essai d’analyse 
sur les jeux de hazard by P. R. de Montmort (1678-1719). It is 
still often referred to by the name that he gave it: ‘probleme des 
rencontres’. Formula (10.2) is sometimes stated as follows. If n 
persons check their umbrellas (a typical Dutch example; it’s always 
raining in Holland) and subsequently pick one at random in the 
dark after a power failure, then the probability that nobody gets 
his own umbrella is roughly e~! (if n is large). 

The second proof in Example 10.2 is an example of the use of 
‘calculus of finite differences’, used extensively in numerical analy- 
sis. 

A. F. Mobius (1790-1868) was an astronomer (and before that an 
assistant to Gauss) who made important contributions to geometry 
and topology (e.g. the Mobius band). 

G. F. B. Riemann (1826-1866) was professor in Gottingen, where 
he also obtained his doctorate under Gauss. He is famous for many 
of his ideas, which include the Riemann integral, Riemann surfaces 
and manifolds, and of course the so-called Riemann hypothesis on 
the location of the zeros of the ¢-function. One wonders what he 
would have left us if he had not died so young. 

In most books in which it occurs, Theorem 10.5 is called Burn- 
side’s lemma. This is just one of many examples of theorems, etc. 
attributed to the wrong person. For a history of this misnomer, we 
refer to Neumann (1979). 

F. E. A. Lucas (1842-1891) was a French number theorist. He 
is known for his books on number theory and mathematical recre- 
ations. The former book contained the problem of Example 10.6. 
The Fibonacci numbers were given this name by Lucas. See the 
notes to Chapter 14. 
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P. M. Neumann (1979), A lemma that is not Burnside’s, Math. 
Scientist, 4, 133-141. 
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Permanents 


Before introducing the main topic of this chapter, we present a 
generalization of Theorem 10.1. As in Theorem 10.1, let S be an 
n-set, E,,...,E, (not necessarily distinct) subsets of S. Let F be 
any field. To each element a € S, we assign a weight w(a) in F. 
For any subset M of {1,2,...,r}, we define W(M) to be the sum 
of the weights of the elements of S in Mien E;. For0<7<r, we 
define W; := diij-; WM) (so Wo = dacs w(a))- 


Theorem 11.1. If E(m) denotes the sum of the weights of the 
elements of S that are contained in exactly m of the subsets E;, 
1<i<r, then 


(11.1) EGS Ee ("*) Wek 


PROOF: The proof is nearly the same as for Theorem 10.1. If 
x € S and z is contained in exactly m of the subsets F;, then the 
contribution of x to the sum in (11.1) is w(x). If « € S and = is 
contained in exactly m-+k of the subsets E;, then the contribution 
to the sum equals 
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We now give the definition of a permanent. Let A = (aj,...,an) 
be an n X n matrix with columns a; = (a4;,... cine ae Then per 
A, the permanent of A, is defined by 
(11.2) per A:= S- Qin(1 * Anr(n)- 

TES, 


So the permanent is defined in the same way as the determinant 
but without the signs depending on whether the permutation 7 is 
even or odd. 

From the definition, the following properties of the permanent 
are obvious. 


(1123) per A=per A’; 


(11.4) 
if P and Q are permutation matrices, then per A = per PAQ; 


(11.5) per A is a linear function of aj, 1<j <n. 


Of course per A is also a linear function of each of the rows of 
A. The permanent of A is much more difficult to calculate than 
its determinant. However, it is clear from (11.2) that expansion by 
rows or columns is possible. So define A;; to be the matrix obtained 
from A by deleting row i and column j. Then 


per A= { Sj Ggper Ay, 19 <n, 


11.6 
Cr) dja aijper Ay, L<i<n. 


The following method of calculating a permanent (due to Ryser) is 
an application of Theorem 11.1. 


Theorem 11.2. Let A be ann x n matrix. If A, is obtained from 
A by deleting r columns, then S(A,) denotes the product of the 
row-sums of A,. We define 1, to be the sum of the values of S(A,) 
for all possible choices of A,. Then 


(11.7) per A= ye 
r=0 
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PROOF: Let S be the set of all products p = aj;, ...@ni;,, and define 
w(p) := p. Define E; to be the set of products p for which j ¢ 
{i1,---,%}. Then the permanent of A is the sum of the weights of 
the elements of S that are not in any of the subsets E;. So (11.7) 
is an immediate consequence of (11.1). 


Problem 11A. Prove (10.4) using Theorem 11.2. 


Remark. If Aj,...,An are subsets of {1,...,n} and aj; = 1 if 
j € Aj, 0 otherwise, then per A counts the number of SDRs of the 
sets Ay,..., An. 


Example 11.1. We find another formula for the number of de- 
rangements of 1,2,...,n. The permanent of the matrix J — I of 
size n is clearly d,. From (11.7) we find 


(11.8) d, = > (-1)" (") (n—r)"(n—r —1)". 


r=0 


n—-Lr 


By expanding the term (n — 1 — r) and applying (10.4) after 
changing the order of summation, we find a complicated proof of 
(10.2). 


During the 1970’s, several well known conjectures on permanents 
of (0,1)-matrices were proved, often by ingenious arguments. In 
fact, much of the research on permanents was motivated by these 
conjectures. Therefore we will devote attention to a number of 
these results in this and the next chapter. As an introduction, we 
consider (0,1)-matrices with two ones in each row and column. 


Theorem 11.3. If A is a (0, 1)-matrix in which all row-sums and 
column-sums are 2, then 


per A< glen) 


PROOF: Consider the graph G whose vertices correspond to the 
rows of A, whose edges correspond to the columns of A, and where 
vertex 7 and edge j are incident exactly when A(i,j) = 1. This 
graph is regular of degree 2, and hence is the disjoint union of 
polygons. The submatrix corresponding to the vertices and edges 
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of a polygon is, after reordering rows and columns if necessary, a 
circulant 


1100--00 
0110--00 
OO011--00 
0000-11 
1000--01 


(This may degenerate into a 2 by 2 matrix of 1’s.) The matrix A 
is the direct sum of such matrices, each of which has permanent 
2. The number of factors is at most |$n]| and we see that equality 
holds in the theorem if A is the direct sum of |}n| matrices J of 
size 2. 


This elementary theorem is concerned with the relationship be- 
tween the row-sums of a matrix and the permanent of that matrix, 
and the same is true for many of the following theorems. This 
brings us to the first difficult question. It was conjectured by H. 


Minc in 1967 that if A is a (0,1)-matrix with row-sums r,...,1n, 
then 
(11.9) per A< [[eo™- 

j=l 


Observe that the proof of Theorem 11.3 shows that equality can 
hold in (11.9) and in fact, we have equality if A is the direct sum of 
matrices J;,, Several results that were weaker than (11.9) were proved, 
often by intricate and long arguments. The conjecture was finally 
proved in 1973 by L. M. Brégman. All the more surprising is the fact 
that A. Schrijver came up with an extremely elegant and very short 
proof of Minc’s conjecture in 1977. The proof depends on the 
following lemma. 


Lemma 11.4. If t,,to,...,t, are nonnegative real numbers, then 
eee tite +t, 
(A+—**) eye. 
‘i 
PROOF: Since x log x is a convex function, we have 


ee Sm fey ty log t; +...t, log t, 
——__ log < 
(a is 


3) 
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which proves the assertion. 


In the following, we use (11.6) in the following form: 


per A= Ss per Aj,. 


k,ajp=l 


Theorem 11.5. Let A be ann x n (0,1)-matrix with r; ones in 
rowi,1<i<n. Then 


perrA< [[@ar”™. 
i=1 


PRooF: The proof is by induction on n. For n = 1, the theorem is 
trivial. We assume that the theorem is true for matrices of size n—1. 
The idea is to estimate (per A)”"?* 4 and to split this expression 
into several products. Now note that r; is the number of values of 
k; for which a;, = 1 and apply the lemma. We find: 


n 


(11.10) (per Aj"? 4 = [[@e Avera 


i=1 


n 
per A per Aix 
<TT(e* TI per at 
i=l 


k,ajn=l 


Now, let S denote the set of all permutations v of {1,...,n} for 
which a,,, = 1 fori = 1,...,n. So |S| = per A. Furthermore, 
the number of v € S such that v; = k is per Aj, if aj, = 1 and 0 
otherwise. So the right-hand side of (11.10) is equal to 


wo ftir) (ie) 


We now apply the induction hypothesis to each Aj,,. This yields 


(11:12) (per AjpPe4 < 


ff twits 


ves i=1 i=l jAt, JAt, 
ajp,=0 ajy,=1 
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Since the number of 7 such that i 4 j and aj,, = 0 is n—r;, and the 
number of 7 such that 7 £ j and aj,, = 1 is r; — 1, we can replace 
the right-hand side of (11.12) by 


ni) fer 


n nper A 
jets = r;! L/r; : 
nent [Te 


vEeS i=1 i=l 


and the assertion is proved. 


We now shall consider a special class of (0,1)-matrices, namely 
the (0,1)-matrices that have exactly k ones in each row and column. 
We denote this class by A(n,k). We define: 


(11.13) M(n,k) := max{per A: A € A(n,k)}, 


(11.14) m(n,k) := min{per A: A € A(n,k)}. 
By taking direct sums, we find the following inequalities: 


(11.15) M(n + N92, k) > M(n, k)M (no, k), 


(11.16) m(n, + n9,k) < m(n,k)m(ng, k). 
These two inequalities allow us to introduce two more functions 
using the following result, known as Fekete’s lemma. 


Lemma 11.6. Let f : N— N bea function for which f(m+n) > 
f(m)f(n) for all m,n € N. Then limp... f(n)/" exists (possibly 
oo). 


PROOF: Fix m and fix J, 1 < m. By induction, we find from the 
inequality for f that f(l + km) > f(l)[f(m)]*. Therefore 


lim inf rae + km) (+km) > f(m)y™ 
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and since there are m possible values for /, we in fact have 
liminf f(n)/" > f(m)/™. 
Now let m — oo. We find that 


lim inf f(n)!/" > limsup f(m)!/", 


but then these are equal. 


The assertion of the lemma is also true if in the inequality for f 
we replace > by <. By applying the lemma to (11.15) and (11.16), 
we can define: 


(11.17) M(k) := lim {M(n, ky}, 
(11.18) m(k) := lim {m(n, k)}/”, 


MOO 


Problem 11B. Prove that M(n,k) > k!. Prove that M(k) < 
(k!)\/*. Show by example that M(k) > (k!)!/". This shows that 
M(k) = (kI)M*. 


The function m(n,k) is much more difficult to handle. What we 
should expect is based on a famous problem still referred to as the 
Van der Waerden conjecture, although in 1981 two different proofs 
of the conjecture appeared (after nearly 50 years of research on this 
question!). We formulate the conjecture below, and the proof will 
be given in the next chapter. 


Conjecture. If A is ann x n matrix with nonnegative entries in 
which all row-sums and column-sums are 1, then 


(11.19) per A>nin™. 


The matrices considered in the conjecture are usually called dou- 
bly stochastic matrices. If A € A(n,k), then dividing all the ele- 
ments of A by k yields a doubly stochastic matrix. Therefore the 
conjecture (now a theorem) shows that m(k) > k/e. This is re- 
markable because the value of M(k) given in Problem 11B tends 
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to k/e for k — oo (see the notes). This means that for large n, 
and A an arbitrary element of A(n,k), the value of (per A)!/” is 
nearly k/e. For a long time, the best lower bound for m(n, 3) was 
n+ 3 and even that was not easy to prove. Once again, the next 
improvement was both considerable and elementary. We now give 
the proof of that result, due to Voorhoeve (1979). 


Theorem 11.7. m(n,3) > 6+ (3)""%. 


PROOF: Let U,, denote the set of n x n matrices with nonnegative 
integers as entries and all row-sums and column-sums 3; u(n) := 
min{per A: A € U,,}. Denote by V,, the set of all matrices obtained 
from elements of U,, by decreasing one positive entry by 1; u(n) := 
min{per A: A € V,,}. We first show that 


(11.20) ies Se) . 


Let A be an element of U,, with first row a = (a4, @2,43,0,...,0), 
where a; > 0 for 1 = 1, 2,3. Since 


2a = a;(a1 — 1, a2, 03,0,...,0) + a2(a1, a2 — 1,03,0,...,0) 
+ a3(Qy, QA2, A3 — Ie, 0, ara 0), 
we find from (11.5) that 2u(n) > (a1 + a2 + ag)u(n) = 3v(n), 


proving the assertion. 
Next, we show that 


(11.21) v(n) > Fug = | 


We must distinguish between two cases. In the first one, A is an 
element of V,, with first row (1, 1,0,...,0) and the matrix obtained 
from A by deleting the first row has the form (c;,c2,B). The 
column-sum of c3 := Cc; + Cp is either 3 or 4. By (11.6), we have 


per A = per (ci, B) + per (c2, B) = per (cs, B). 


If the column-sum of c3 is 3, then the matrix (c3, B) is in U,_,; and 
we are done by (11.20). If the sum is 4, then we use the same trick as 
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above: write 3c3 as a linear combination of four vectors d; such that 
each matrix (d;, B) is in V,_; and we find that 3per A > 4v(n—1). 
The second case that we have to consider is that A has (2,0,...,0) 
as first row. If we delete the first row and column of A, then there 
are again two possibilities. We obtain a matrix B that is either in 
U,-1 or in V;-1. So we have per A > 2min{u(n —1), v(n —1)} and 
we are done by (11.20). By combining (11.20) and (11.21) with the 
trivial value v(1) = 2, the assertion of the theorem follows. 


We now consider a larger class of n x n matrices, namely those 
with nonnegative integers as entries and all row-sums and column- 
sums equal to k. We denote this class by A(n,k) and the minimal 
permanent within the class by A(n, k). Again we have A(m+n, k) < 
A(m, k)A(n, &) and by Fekete’s lemma, we can define 


(11.22) O(k) := lim (A(n, k))¥”. 


N— OO 


From Theorem 11.3 and Theorem 11.7, we know that A(n,2) = 2 
and A(n,3) > 6- (#)""%. From (11.19), we have seen above that 
\(n,k) > n!(£)". We have also seen that there is a connection 
between permanents and SDRs. We now show a proof in which 
this connection is exploited. 


Theorem 11.8. \(n,k) < k?"/("). 

PROOF: We denote by P,,;, the collection of all ordered partitions 
of the set {1,2,...,nk} into classes of size k. We have 

(nk)! 

(hl)? 7 


(11.23) Pak = |Prkl = 


Now let A := (Aj,...,An) be such a partition. The number of 
SDRs of the subsets A;,...,A, is k”. Consider a second partition 
B := (B,,...,B,). We denote by s(.A,8) the number of common 
SDRs of A and B. We define an n x n matrix A with entries aj; 
by aj; := |A;NB;|. The point of the proof is the fact that per A 
counts the number of common SDRs of A and Bb. Furthermore, by 
definition of the partitions, the matrix A is in A(n,k). Therefore 


s(A,B) = per A> X(n,k). 
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If A € Py» is given and some SDR of A is given, then there are 
n!pn~—1 ordered partitions B that have this same SDR. Hence we 
have 

a S(A, By =k" on! papi: 


BEePn x 


Combining this with (11.23) and the inequality for \(n,k), we find 


kn . nIDy k-1 ae 
A(n, k) < —— = : 
(n ) Pnk () 


n 


This proof is due to Schrijver and Valiant (1980) who also gave 
the following corollary. 


Corollary. 6(k) < “2. 


PROOF: This follows in the usual way from the previous theorem 
by using Stirling’s formula: n! ~ n"e~"(270n)"/?. 


The corollary combined with Theorem 11.7 gives us one more 


value of 6(k), namely 6(3) = §. 


Problem 11C. Consider the set of integers 1,2,...,64. We first 
remove the integers = 1 (mod 9), ie. x; = 1,...,23 = 64. Then 
we remove the integers x; + 8, where 72 is to be interpreted as 
8. This leaves us with a set S of 48 elements. We partition S 
into subsets A;,...,Ag and also into subsets B,,..., Bs, where A; 
contains integers in the interval (8(7 — 1), 87] and B; contains the 
integers = i (mod 8). How many common SDRs are there for the 
systems Aj,...,Ag and By,..., Bs? 


Problem 11D. We return to Problem 5G. Again, consider a bi- 
partite graph on 2n vertices that is regular of degree 3. Give a 
lower bound for the number of perfect matchings. 


Problem 11E. Ona circular arrangement of the integers 1,2,...,n 
consider the subsets {7,7 + 1,1 +2} where i = 1,2,...,n and inte- 
gers are interpreted mod n. How many SDR’s does this collection 
of sets have? 
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Notes. 

In his book Permanents, H. Minc (1978) mentions that the name 
permanent is essentially due to Cauchy (1812) although the word as 
such was first used by Muir in 1882. Nevertheless, a referee of one of 
Minc’s earlier papers admonished him for inventing this ludicrous 
name! For an extensive treatment of permanents, we refer to Minc’s 
book. There, one can find much of the theory that was developed 
mainly to solve the Van der Waerden conjecture (without success 
at the time of writing of the book). 

Theorem 11.2 is from Ryser (1963). 

For a number of results related to the Minc conjecture, we refer 
to Van Lint (1974). 

The lemma known as Fekete’s lemma occurs in Fekete (1923). 
For another application, we refer to J. W. Moon (1968). 

The term doubly stochastic matrix can be motivated by consid- 
ering the entries to be conditional probabilities. However, perma- 
nents do not seem to play a role of importance in probability theory. 

The remarks concerning m(k) and M(k) preceding Theorem 11.7 
are based on Stirling’s formula and the related inequality n! > 
ne". This inequality is easily proved by induction, using the fact 
that (1+n~')” is increasing with limit e. Actually Stirling’s formula 
was first given by de Moivre. Stirling derived an asymptotic series 
for the gamma function which leads to the estimate 


Pin) a” 2¢e-* (Qn) 2 ¢8/(122) 


where 0 < @< 1. (n! =T(n+1).) 
A. Schrijver (1998) has established equality in the Corollary to 
Theorem 11.8 by proving that 


ana) > (SPY 
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The Van der Waerden 
conjecture 


In this chapter, we denote the set of all doubly stochastic matrices 
of size n x Q,. The subset consisting of matrices for which all 
entries are positive is denoted by 0*. We define J, := n~'J, where 
J denotes the n x n matrix for which all entries are 1. The vector 
(1,1,...,1)' is denoted by j. 

In 1926, B. L. van der Waerden proposed as a problem to deter- 
mine the minimal permanent among all doubly stochastic matrices. 
It was natural to assume that this minimum is per J, = n! n~” (as 
stated in (11.19)). The assertion 


(12.1) (A EQ, and A J,) => (per A > per Jn) 


became known as the ‘Van der Waerden conjecture’ (although in 
1969 he told one of the present authors that he had not heard this 
name before and that he had made no such conjecture). In 1981 two 
different proofs of the conjecture appeared, one by D. I. Falikman, 
submitted in 1979, and one by G. P. Egoritsjev, submitted in 1980. 
We shall give our version of Egoritsjev’s proof which had a slightly 
stronger result than Falikman’s, cf. Van Lint (1981). 

In the following, we shall use the term minimizing matrix for a 
matrix A € 2, such that per A = min{per S$: S €,}. As usual, 
the matrix obtained from A by deleting row 7 and column j is 
denoted by A;;. We often consider A as a sequence of n columns and 


write A = (a,,...,a,). Later on, we shall consider permanents of 
matrices of size n—1 but we wish to use the notation for matrices of 
size n. The trick is to write per (aj,...,a,_1,e;), where e; denotes 


the j-th standard basis vector. This permanent does not change 
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value if the 7-th row and n-th column are deleted. We remind the 
reader that by Problem 5C (Birkhoff), the set 2, is a convex set 
with the permutation matrices as vertices. 

We need a few elementary results on matrices in 2,,. The first 
statement is the same as Theorem 5.4. 


Theorem 12.1. If A is ann x n matrix with nonnegative entries, 
then per A = 0 if and only if A contains an s x t zero submatrix 
such thats+t=n-+1. 


We shall call an n x n matrix partly decomposable if it contains 
ak x n—k zero submatrix. So A is partly decomposable if there 
exist permutation matrices P and Q such that 


PAQ = iG De 


where B and D are square matrices. If a matrix is not partly 
decomposable, then we shall say that it is fully indecomposable. If 
A€éQ, and A is partly decomposable, then, in the representation 
given above, we must have C' = O, because the sum of the entries 
of B equals the number of columns of B and the sum of the entries 
of B and C is equal to the number of rows of B. So in that case, 
A is the direct sum B+ D of an element of 0; and an element of 
Opps. 


Problem 12A. Let A be ann x n matrix with nonnegative entries 
(n > 2). Prove that A is fully indecomposable if and only if per 
Aj; > 0 for all i and 7. 


Problem 12B. Let A be an n Xx n matrix with nonnegative en- 
tries. Prove that if A is fully indecomposable, then AA' and A'A 
are also fully indecomposable. 


Theorem 12.2. A minimizing matrix is fully indecomposable. 


PrRooF: Let A € 2, be a minimizing matrix and suppose that A 
is partly decomposable. Then, as we saw above, A = B+C, where 
BeQ, and C € Q,-;. By Theorem 12.1, we have per Aj441 = 0 
and per A414 = 0. By Birkhoff’s theorem, we may assume that B 
and C' have positive elements on their diagonals. In A we replace 
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bre by ber — € and ci, by cy, — € and we put an « in the positions 
kjk +1andk+1,k. The new matrix is again in (),, if € is small 
enough. The permanent of the new matrix is equal to 


per A—eper Aj, —eper Agsi gy + O(e). 


Since per Ajj; and per Aj+1,441 are both positive, this new perma- 
nent is smaller than per A if € is sufficiently small. This contradic- 
tion proves the assertion. 


Corollary. (i) A row of a minimizing matrix has at least two pos- 
itive entries. 

(ii) For any a,j; in a minimizing matrix, there is a permutation 0 
such that o(1) =j and a,,(3) > 0 forl<s<n,s i. 


PROOF: Clearly (i) is trivial, and (ii) follows from Problem 12A. 


Let us now look at how far we can get with calculus. A very 
important step in the direction of a proof of (12.1) is the following 
surprising result due to Marcus and Newman (1959). 


Theorem 12.3. If A € Q, is a minimizing matrix and ap, > 0, 
then per Ap, = per A. 


PROOF: Let S' be the subset of 2, consisting of the doubly stochas- 
tic matrices X for which x;; = 0 if aj; = 0. Then A is an interior 
point of the set S, which is a subset of R” for some m. If we denote 
the set of pairs (7,7) for which a;; = 0 by Z, we can describe S by 
the relations: 


xi; > 0, t9 = lyeesgtt} 
vij=0, (4,7) € Z. 


Since A is minimizing, the permanent function has a relative mini- 
mum in the interior point A of the set S and we can use Lagrange 
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multipliers to describe the situation. So we define: 


F(X) := per X -SoA (So — 7 _ So uy (Son — : : 
i=1 k=l j=l k=1 
For (i, 7) € Z, we have: 
It follows that per Aj; = A; + uw; and from this we find that for 
l<i<n 
(12.2) per A= S— ajjper A= y Cipla fy) = AF le 
j=l j=l 
and similarly for 1 <j <n, 
(19.3) per A= pj + S- aij ri 
i=1 


We introduce the vectors \ = (A1,.--,An)' and p= (p1,.--5 bn). 
From (12.2) and (12.3), we find 


(12.4) (per A)j=At+Au=pyt+A'X. 
Multiplying by A! gives us 
(per A)j= A'A+ Al Ap, 


and hence p = A! Ap, and similarly \ = AA'). The matrices AA! 
and A'A are both in Q, and by Problem 12B and Theorem 12.2, 
they have eigenvalue 1 with multiplicity one corresponding to the 
eigenvector j. So we see that both and yw are multiples of j. By 
(12.4), we have \; +; = per A and since per Aj; = 4+ 14j, we are 
finished. 
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Remark. It was shown by Marcus and Newman that Theorem 
12.3 implies that a minimizing matrix in 2% must be J,. The proof 
depends on the following idea. Let A be an element of 2, with 
the property that per Aj; = per A for all h,k. If we replace any 
column of A by a vector x for which )>;"_, xj; = 1, then the value of 
the permanent does not change (by (11.6)). We shall refer to this 
idea as the substitution principle. If A is a minimizing matrix in QF, 
then the substitution principle allows us to replace any two columns 
of A by their average and thus obtain a new minimizing matrix. In 
this way, one constructs a sequence of minimizing matrices which 
tends to J,. The uniqueness of the minimum takes a little extra 
work. 


A final result that uses ideas from calculus is the following gen- 
eralization of Theorem 12.3 due to London (1971). 


Theorem 12.4. If A € Q,, is a minimizing matrix, then per Aj; > 
per A for alli and j. 


Proor: Let 7 and j be given. By Corollary (ii) of Theorem 12.2, 
there is a permutation o such that o(i) = j and a,,(s) > 0 for 
1<s<n,s i. Let P be the corresponding permutation matrix. 
For 0 < 6 < 1 we define f(@) := per ((1—@)A+06P). Since Aisa 


minimizing matrix, f’(0) > 0, i.e. 


n n nm 


0< S- So (-a5j + p;;)per Aj; = —nper A+ Y= per Agate): 


i=1 j=l s=1 


By Theorem 12.3 we have per A,.,(s) = per A for s £7 and therefore 
per A;; > per A. 


Problem 12C. Show that Theorem 12.3 implies that if A € QF 
is a minimizing matrix, then there is a minimizing matrix B € QF 
that has aJ of size 4 as a principal submatrix. Then show that a 
must be : 


We now come to the main tool in the proof of the Van der Waer- 
den conjecture. This time we need linear algebra. We shall give a 
direct proof of a theorem on symmetric bilinear forms which leads 
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to the inequality that was derived by Egoritsjev from the so-called 
Alexandroff-Fenchel inequalities (which we do not treat). 

Consider the space R” with a symmetric inner product (x,y) = 
x'Qy. If Q has one positive eigenvalue and n — 1 negative eigen- 
values, we shall speak of a Lorentz space. We use the following 
standard terminology: a nonzero vector x is isotropic if (x,x) = 0, 
positive, respectively negative, if (x, x) is positive, respectively neg- 
ative. 

Every nonzero vector in the (n — 1)-dimensional span of the neg- 
ative eigenvectors is negative, so if a is positive and b is not a 
scalar multiple of a, then the plane spanned by a and b must con- 
tain a negative vector. Therefore the quadratic form in A given by 
(a+ Ab,a+ Ab) must have a positive discriminant. Thus we have 
the following inequality, which is like the Cauchy inequality but the 
other way around. 


Theorem 12.5. Ifa is a positive vector in a Lorentz space and b 
is arbitrary, then 
(a, b)* > (a,a)(b, b) 


and equality holds if and only if b = Xa for some constant X. 


The connection with permanents is provided by the following 
definition. Consider vectors aj,...,A@,—2 in R” with positive coor- 
dinates. As usual, let e;,...,e, be the standard basis of R”. We 
define an inner product on R” by 


(12.5) (x, y) = per (ay, A2,---,An—2,X, y), 


1.e. 

where Q is given by 

(12.6) Gij -= per (aj, ao,...,An—2, Gj, ej). 

Note that if A is a matrix with columns aj,...,a, and we delete 


the last two columns and the rows with index i and 7, then the 
reduced matrix has permanent equal to qi;. 
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Theorem 12.6. The space R” with the inner product defined by 
(12.5) is a Lorentz space. 


PROOF: The proof is by induction. For n = 2, we have Q = & i) 


and the assertion is true. Now assume the theorem is true for R"~!. 
In the first step of the proof, we show that @ does not have the 
eigenvalue 0. Suppose Qc = 0, i.e. 


(12.7) per (aj,...,ap—2,€,e;) = 0 for lL < 4 <7. 


By deleting the last column and the j-th row, we can consider (12.7) 
as a relation for vectors in R"-'. We consider the inner product 
given by 


(12.8) per (a1,---,An—3, X,Y, ej) jn 


and apply the induction hypothesis, (12.7) and Theorem 12.5. Sub- 
stitution of x = an_2, y = ap_2 in (12.8) gives a positive value, and 
X = an_2, Y = C gives the value 0. Therefore 


(12.9) per (aj,...,An—3,¢,¢,e;) < 0 forl<j<n 


and for each 7 equality holds if and only if all coordinates of c 
except c; are 0. If we multiply the left-hand side of (12.9) by the 
j-th coordinate of a,_2 and sum over 7, we find c'Qc. Therefore 
the assumption Qc = 0 implies that c = 0. 

For 0 < 6 < 1, we define a matrix Q» by taking (12.5) and 
replacing every a; by 0a; + (1 — 0)j. From what we have shown 
above, it follows that for every @ in [0,1] the matrix Qg does not 
have the eigenvalue 0. Therefore the number of positive eigenvalues 
is constant. Since this number is one for @ = 0, it is also one for 
? = 1, which proves our assertion. 


We formulate the combination of Theorem 12.5 and Theorem 
12.6 as a corollary. (The final assertion follows by continuity.) 


Corollary. Ifa,,...,a,—; are vectors in R” with positive coordi- 
nates and b € R", then 


(per (aj,...,@p—1,b))* 
> per (aj,..-,@n—1,An-1) + per (a1,.--,an—2, b, b) 
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and equality holds if and only if b = Aa,_, for some constant X. 
Furthermore, the inequality also holds if some of the coordinates 
of the a; are 0, but the assertion about the consequence of equality 
then cannot be made. 


We are now able to generalize Theorem 12.3. 


Theorem 12.7. If A €Q,, is a minimizing matrix, then per Ajj = 
per A for alli and j. 


PROOF: Suppose that the statement is false. Then by Theorem 
12.4, there is a pair r,s such that per A,, > per A. 

Choose t such that a,; > 0. Consider the product of two factors 
per A. In the first of these we replace a, by a;, and in the second, 
we replace a; by a;. Subsequently, we develop the first permanent 
by column s and the second permanent by column t. By Theorem 
12.5 and the Corollary to Theorem 12.6, we have 


(per A)? (> Art Per Au (>: Aks per sue . 
k=1 


By Theorem 12.4, every subpermanent on the right-hand side is at 
least per A and per A,;, > per A. Since per A;, is multiplied by 
a,- Which is positive, we see that the right-hand side is larger than 
(per A)’, a contradiction. 


We now use the substitution principle as follows. Take a mini- 
mizing matrix A and let u and v be two columns of A. Replace u 
and v by 5(u +v). The new matrix is again a minimizing matrix 
by Theorem 12.7. 

Let A be any minimizing matrix and let b be any column of A, 
say the last column. From Corollary (i) to Theorem 12.2, we know 
that in every row of A there are at least two positive plemente: We 
now apply the substitution principle (as sketched above) a num- 
ber of times but we never change the last column. In this way, 
we can find a minimizing matrix A’ = (aj,...,a/,_,,b) for which 
aj,...,a/_, all have positive coordinates. Now apply the Corollary 
to Theorem 12.6. By the substitution principle, equality must hold. 
Hence b is a multiple of a; for any i with 1 < i <n—1. This implies 
that b = n~'j and therefore A = n~!J,, which completes the proof 
of the Van der Waerden conjecture. 
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Theorem 12.8. The implication (12.1) is true. 


Notes. 

For a survey of the two proofs of the Van der Waerden conjec- 
ture, we refer to Van Lint (1982). There one also finds historical 
comments and a nice anecdote concerning the conjecture. 

B. L. van der Waerden (1903-1996), a Dutch mathematician, 
was known mostly for his work in algebra, although he published 
in several fields. His work Moderne Algebra (1931) set the trend 
for many decades. 

Minc’s book on permanents is the best reference for all the work 
that was done in relation to the Van der Waerden conjecture up to 
1978. 

The name Lorentz space is related to relativity theory and the 
group of transformations that leave the quadratic form x? + y? + 
2? —#? invariant. H. A. Lorentz was a Dutch physicist who won the 
Nobel prize for his work. 
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Elementary counting; 
Stirling numbers 


The next few chapters will be devoted to counting techniques and 
some special combinatorial counting problems. We start with sev- 
eral elementary methods that are used quite often. Consider map- 
pings from {1,2,...,n} to {1,2,...,4}. Their total number is k”. 
In Example 10.2, we studied the case where the mappings were 
required to be surjective. We return to this question in Theorem 
13.5. If the mappings are injections, then their number is the falling 
factorial 


(13.1) (k)n = k(k—-1)...(k—-n +1) =kY/(K—n)I. 


We now consider a similar problem. The n objects to be mapped 
are no longer distinguishable but the images are. We formulate this 
as follows. We have n indistinguishable balls that are to be placed 
in k boxes, marked 1,2,...,/. In how many different ways can this 
be done? The solution is found by using the following trick. Think 
of the balls as being colored blue and line them up in front of the 
boxes that they will go into. Then insert a red ball between two 
consecutive boxes. We end up with a line of n+-k—1 balls, k—1 of 
them red, describing the situation. So the answer to the problem 


is Ce): We formulate this as a theorem. 


Theorem 13.1. The number of solutions of the equation 


(13.2) cp Ppegtss: aan 


n+k— i) ; 


in nonnegative integers is ( By 


PRooF: Interpret x; as the number of balls in box 27. 
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Corollary. The number of solutions of the equation (13.2) in pos- 
itive integers is aa 

PROOF: Replace x; by y; := 2; —1. Then SJy; = n—k. Apply 
Theorem 13.1. 


Example 13.1. By analogy with the question we encountered in 
Example 10.6, we consider the problem of selecting r of the integers 
1,2,...,n such that no two selected integers are consecutive. Let 
Ly < X22 < +--+ < x, be such a sequence. Then 2; > 1, x2 — 2 > 
2,...,L, — Lp_1 > 2. Define 


Y= 21, Y= ey —-H1—-1, 25057, Yui n—a2a, +i. 


Then the y; are positive integers and eae yi =n—r+2. By the 
Corollary to Theorem 13.1, we see that there are Ca) solutions. 


Problem 13A. On a circular array with n positions, we wish to 
place the integers 1, 2,...,7 in order, clockwise, such that consecu- 
tive integers, including the pair (r,1), are not in adjacent positions 
on the array. Arrangements obtained by rotation are considered 
the same. In how many ways can this be done? 


Example 13.2. In how many ways can we arrange 1 balls of color 
1, ro balls of color 2,...,7; balls of color k in a sequence of length 


nmi=ry+rot---+r,? If we number the balls 1 to n, then there are 
n! arrangements. Since we ignore the numbering, any permutation 
of the set of r; balls of color 7, 1 < i < k, produces the same 
arrangement. So the answer to the question is the multinomial 
coefficient ("| ); see (2.1). 


T1 5k 
Example 13.3. We wish to split {1,2,...,n} into b; subsets of size 
1, bo subsets of size 2, ..., by, subsets of size k. Here pe 1b; =n. 
The same argument as used in Example 13.2 applies. Furthermore, 
the subsets of the same cardinality can be permuted among them- 
selves without changing the configuration. So the solution is 


n! 
by! .. Dp t(1!)o2 (2!)2 2. (RY) be” 


(13.3) 


Several counting problems (often involving binomial coefficients) 
can be done in a more or less obvious way that leads to a (sometimes 
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difficult) calculation. Often there is a less obvious ‘combinatorial’ 
way to do the counting that produces an immediate answer. We 
give a few examples. 


Example 13.4. Let A run through all subsets of {1,2,...,n}. 
Calculate S = 5>|A|. Since there are (") subsets of size i, we 
apparently must calculate $7/)i(/). By differentiating (1 + x)”, 


i 
we find 
n 


S- i (") go anl+a)"" 
i 
i=1 
and substitution of x = 1 yields the answer S = n-2"~'. If we had 
spent a little more time thinking, then this answer would have been 
obvious! A set A and its complement together contain n elements 
and there are exactly 2”~! such pairs. 


Example 13.5. In Chapter 10 we saw some examples of formulae 
involving binomial coefficients for which a combinatorial proof was 
easier than a direct proof. The familiar relation 


a ¥() -@) 


is another example. Of course one can calculate this sum by de- 
termining the coefficient of x” in (1+ x7)"(1+ x)" and using the 
binomial formula. However, each side of (13.4) just counts (in two 
ways) the number of ways of selecting n balls from a set consisting 
of n red balls and n blue balls. 


Problem 13B. Show that the following formula for binomial co- 
efficients is a direct consequence of (10.6): 


n+1 — (k\ (n—k 
Go) Caw 
k=0 
Give a combinatorial proof by considering (a + b + 1)-subsets of 


the set {0,1,...,n}, ordering them in increasing order, and then 
looking at the value of the integer in position a+ 1. 


Example 13.6. We consider a slightly more complicated example 
where our previous knowledge could lead us to an involved solu- 
tion. How many sequences A;,...,A, are there for which A; C 
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(hooey Ss Keane (ese A; = {1,2,...,n}? Since we wish 
to avoid that 7, 1 <j <n, is not an element of the union of the 
A;’s, we are tempted to use inclusion-exclusion. If we choose 7 ele- 
ments from {1,2,...,n} and consider all sequences Aj,..., A, not 
containing any of these i elements, then we find (2”~')* sequences. 
So by Theorem 10.1, the solution to the problem is 


y-0' Cr = (ak — 1)", 


i=0 


This answer shows that another approach would have been better. 
If we describe a sequence Aj,..., A, by a (0,1)-matrix A of size 
k x n, with the characteristic functions of the subsets as its rows, 
then the condition on the sequences states that A has no column 
of zeros. So there are (2* — 1)" such matrices! 


Problem 13C. Give a solution involving binomial coefficients and 
a combinatorial solution to the following question. How many pairs 
(A, Aa) of subsets of {1,2,...,n} are there such that AM Ag = 0? 


Problem 13D. Consider the set S of all ordered k-tuples A = 
(A,,..., Ax) of subsets of {1,2,...,n}. Determine 


S "Ar U Ag U +++ U Ag. 
Aes 


Problem 13E. The familiar relation 
1 
m=k k k + 1 


is easily proved by induction. The reader who wishes to, can find a 
more complicated proof by using (10.6). Find a combinatorial proof 
by counting paths from (0,0) to (J+1,k+1) in the X-Y plane where 
each step is of type (x,y) — (a+ 1,y) or (2,y) — (wa + 1,y +1). 
Then use the formula to show that the number of solutions of 


Ly tXet--- +e, <n 
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in nonnegative integers is (ors Can you prove this result combi- 
natorially? 


It is often possible to prove relations between binomial coeffi- 
cients by considering () formally as a polynomial in a defined by 


@ . Cia alia a! 


If two polynomials of degree < k agree for k + 1 values of the 
variable, they are identical. We give an example. 
Let 


Fay =D (Aaty Ge => ("2 )Cae ry 


k=0 k=0 


From the binomial theorem we know that if a is an integer in the 
interval [0,n], then F(a) = (x + y)*y""* but so is G(a), again by 
the binomial theorem. So, the polynomials are identical and we 
can substitute any numbers for a,x, and y to obtain relations for 
binomial coefficients. For instance, y = 2x and a = 2n + 1 yields: 


. ee ‘) gn-k = 3 (" : ‘) Brek, 


k=0 k=0 


a relation that would be extremely difficult to prove by counting. 

Two kinds of numbers that come up in many combinatorial prob- 
lems are the so-called Stirling numbers of the first and second kind. 
The numbers are often defined by the formulae (13.8) and (13.12) 
given below. We prefer a combinatorial definition. 

Let c(n,k) denote the number of permutations 7 € S,, with ex- 
actly k cycles. (This number is called a signless Stirling number of 
the first kind.) Furthermore define c(0,0) = 1 and c(n,k) = 0 if 
n<0Oork <0, (n,k) 4 (0,0). The Stirling numbers of the first 
kind s(n,k) are defined by 


(13:5 } s(n, k) := (—1)""*e(n, k). 
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Theorem 13.2. The numbers c(n,k) satisfy the recurrence rela- 
tion 


(13.6) c(n,k) = (n—1)c(n—1,k) +e(n—1,k -1). 


PrRooF: If 7 is a permutation in S,_, with k cycles, then there 
are n — 1 positions where we can insert the integer n to produce 
a permutation 7’ € S,, with k cycles. We can also adjoin (n) as a 
cycle to any permutation in S,_; with k — 1 cycles. This accounts 
for the two terms on the right-hand side of (13.6). 


Theorem 13.3. For n > 0 we have 


n 


(13.7) Si e(n,k)a* = a(@+1)...(2+n-1) 
k=0 
and 
(13.8) s(n, k)a* = (x)n, 
k=0 


where (x), is defined as in (13.1). 
PROOF: Write the right-hand side of (13.7) as 


F, (a) = S— b(n, k)2*. 
k=0 


Clearly b(0,0) = 1. Define b(n, k) := 0 ifn <0 ork <0, (n,k) 4 
(0,0). Since 


F(a) = (2 +n — 1) Fy-1(2) 
=) o(n-1,k-1)x* + (n-1) So do(n-1,k) ak, 


k=1 


it 
Co 


we see that the numbers b(n, k) satisfy the same recurrence relation 
as the c(n,k), namely (13.6). Since the numbers are equal if n < 0 
or k < 0, they are equal for all n and k. 
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To prove (13.8) replace « by —x and use (13.5). 


We remark that it is possible to give a combinatorial proof of 
(13.7) by showing that both sides of the equation count the same 
objects. 

We now define the Stirling numbers of the second kind: denote 
by P(n,k) the set of all partitions of an n-set into k nonempty 
subsets (blocks). Then 


(13.9) S(n,k) := |P(n, k)|. 


Again we have $(0,0) = 1 and take the numbers to be 0 for all 
values of the parameters not covered by the previous definition. 
Again we have an easy recurrence relation. 


Theorem 13.4. The Stirling numbers of the second kind satisfy 
the relation 


(13.10) S(n,k) = kS(n-—1,k)+S(n-—1,k—-1). 


ProoF: The proof is nearly the same as for Theorem 13.3. A 
partition of the set {1,2,...,n— 1} can be made into a partition 
of {1,2,...,n} by adjoining n to one of the blocks or by increasing 
the number of blocks by one by making {n} a block. 

We define the Bell number B(n) to be the total number of parti- 
tions of an n-set, i.e. 


(13. E1) Bia = So Sts k), (1). 
k=1 


For the Stirling numbers of the second kind there is a formula 
similar to (13.8). 


Theorem 13.5. For n > 0 we have 
(13.12) m= S. G(n,k) (oe. 
k=0 


PrRooF: We first remark that by (13.9) the number of surjective 
mappings from an n-set to a k-set is k!S(n,k) (a block of the parti- 
tion is the inverse image of an element of the k-set). So by Example 
10.2, we have 
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(13:13)<S. ni) = BED ( k-i)" = 1 (aay Ce 


Now let x be an integer. There are x” mappings from the n-set 
N := {1,2,...,n} to the z-set {1,2,...,2}. For any k-subset Y of 
{1,2,...,a}, there are k!S(n,k) surjections from N to Y. So we 
find 


In Example 10.1, we saw that it can be useful to associate a so- 
called generating function with a sequence a1, d9,... of numbers. 
In Chapter 14, we will see many uses of the generating function. 
At this point we treat the generating functions for the Stirling 
numbers. 


Theorem 13.6. )>,,5;,5(n,k)= = Ale" — 1) (hk = 0): 


Proor: Let F(x) denote the sum on the left-hand side. By (13.10) 
we have 


Fy(x) = kFi,(x) + Fy_i(2). 


The result now follows by induction. Since S(n,1) = 1, the as- 
sertion is true for k = 1 and the induction hypothesis yields a 
differential equation for Fi, which with the condition S(k,k) = 1 
has the right-hand side of the assertion as unique solution. 


For the Stirling numbers of the first kind, it is slightly more 
difficult to find the generating function. 


Theorem 13.7. 0, s(n, k)= = §(log(1 + z))*. 


PROOF: Since 


; 1 
(1+ z)* = er attts) = >) _qilog(l + 2)) "a 
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the right-hand side in the assertion is the coefficient of x* in the 
expansion of (1 + z)*. On the other hand, we have for |z| < 1, 


aa a 
(1 ye = : ( ye = : — (2) nz" 
n nl 
n=0 n=0 
CO n n CO CO n 
z z 
= ) — )_ s(n, ra" = ) a” )_s(n,r)— 
n=0 te r=0 r=0 n=P 


This completes the proof. 


Problem 13F. Show directly that the number of permutations of 
the integers 1 to n with an even number of cycles is equal to the 
number of permutations with an odd number of cycles (n > 1). 
Also show that this is a consequence of Theorem 13.7. 


Finally we mention that the Stirling numbers of the first and 
second kind are related by 


(13.14) S— S(n,k)s(k,m) = dnn- 

k=m 
This follows immediately if we substitute (13.8) in (13.12). Since 
the functions x”, respectively (x)n, with n > 0 both form a basis 
of the vector space C[x], the formula (13.14) is just the standard 
relation between the matrices for basis transformation. 


Problem 13G. Show that (13.12) leads to B(n) = +70, 5 


~~ e ke 
Problem 13H. Let A be the n x n matrix with a;; := (;) for 
i,j =0,...,2—1. Determine A7!. 
Problem 131. From (10.6) one finds, by taking « = s, that 
ae (Ss )23 = 2*+!. Using a method from this chapter prove 


J=O0\ j 
v3 i 
J 


that 
j=0 


Prove the same result directly by defining 


an = 3 (" ’) aes, 
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substituting the basic recurrence for binomial coefficients, thus find- 
ing that a, = $Qn - FAn—1. 


Problem 13J. Suppose we call a set nice if its cardinality is di- 
visible by 3. How many nice subsets does an n-set have? 


Problem 13K. Prove that }>°, S(n,n — 2)a" = ae 


Notes. 

Stirling numbers of the first and second kind appear in many 
areas of mathematics; for example, they play a role in a number 
of interpolation formulae and in the calculus of finite differences. 
There are tables of the numbers in several books on tables of math- 
ematical functions. 

Later, in Chapter 37, these numbers will reappear. 

James Stirling (1692-1770), a Scottish mathematician, studied 
at Oxford. He taught mathematics at Venice and London but 
switched to a career in business at the age of 43. 
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Recursions and 
generating functions 


Many combinatorial counting problems with a solution a, depend- 
ing on a parameter n, can be solved by finding a recursion relation 
for a, and then solving that recursion. Sometimes this is done by 
introducing an ordinary generating function 


fe) = ae 


n>0 


or an exponential generating function 


f(z) = an, 


n>0 


and using the recursion to find an equation or a differential equation 
for f(x), and solving that equation. We shall demonstrate several 
of the techniques involved. 


Example 14.1. As an introduction, consider once again Example 
10.1. Let a be a derangement of {1,2,...,2 +1}. There are n 
choices for m(n + 1). If t(n +1) =i and a(t) =n +1, then 7 is 
also a derangement on the set {1,2,...,n}\{i}. If 7(n+1) =i and 
m(t) An+1=n7(J), then replacing 7(j) by 7 yields a derangement 
on the set {1,2,...,n}. Therefore 


(14.1) diay = n(dn ar dn—i), 


which is also an immediate consequence of (10.3). Let D(a) be 
the exponential generating function for the sequence dy = 1,d, = 
0, dz,.... From (14.1) we immediately find 


(1—2)D'(x) = xD(z2), 
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and from this we find D(x) = e~*/(1 — x) and (10.2). 


In many cases, we use the generating functions only as a book- 
keeping device, and our operations of addition, multiplication (and 
even substitution and derivation, as we shall see below) are to be 
interpreted formally. It is possible to give a completely rigorous 
theory of formal power series (as algebraic objects) and we give 
an introduction to this theory in Appendix 2. In most cases, it is 
intuitively clear and easy to check that the operations are legiti- 
mate. If the series that we use actually converge, then we can use 
all appropriate knowledge from analysis concerning these series, as 
we did in Example 14.1. We give another elementary example. 


Example 14.2. Suppose that we have k boxes numbered 1 to k 
and suppose that box 7 contains r; balls, 1 <i< k. A formal 
bookkeeping device to list all possible configurations is to let the 


named one correspond to the term 2x5? ---x,* in the product 


(l+a, +22 +---)(1+¢o+a3-+-+-)--+( + ay +22 +--+). 


We can collect all the terms involving exactly n balls by taking 
x; = x for alli, and considering the terms equal to x”. Therefore we 
find that the number of ways to divide n balls over k distinguishable 
boxes is the coefficient of 2” in the expansion of (1 — x)~", and by 
(10.6) this is (oo) giving a second proof of Theorem 13.1. 


n 


In many cases, the combinatorial problem that we are interested 
in leads to a linear recursion relation with constant coefficients, 
which is easily solved by standard methods. 


Example 14.3. We consider paths of length n in the X-Y plane 
starting from (0,0) with steps R: (#,y) - (a@+1,y), L: (a,y) 
(x—1,y), andU : (x,y) — (a, y+1) (i.e. to the right, to the left, or 
up). We require that a step R is not followed by a step L and vice 
versa. Let a, denote the number of such paths. First observe that 
if we denote by 6, the number of paths of length n starting with a 
step U, then b, = a,_, and furthermore trivially by1 > byb» and 
b, < 3"-1. So by Fekete’s lemma, Lemma 11.6, limy. bn!” exists 
and is at most 3. Next, note that aj = 1 and a, = 3. We split 
the set of paths of length n into subsets depending on the last one 
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or two steps. Clearly there are a,_; paths ending with the step U. 
Take a path of length n —1 and repeat the last step if it is Z or R, 
and adjoin a step L if the last step was U. In this way, we obtain 
all the paths of length n that end in LL, RR, or UL. So there are 
Gn—1 Of these. It remains to count the paths ending with UR and 
again it is trivial that there are a,_2 of these. We have shown that 


an = 20-4 + Gn—2 (n 2 2). 
Let f(x) = 07.) a@nz”. Then the recursion implies that 
f(z) =14+ 3a + 2x (f(x) —1)+ 2’ f(x), 


1.e. 


Tp eae eee 


where a=1+ V2, GB =1- 2. Therefore 


f(@) 


1 
An = stan aie Ber 


and we find limp... ai/" = 1+ V2. 

Problem 14A. (i) Let a, denote the number of sequences of 0’s 
and 1’s that do not contain two consecutive 0’s. Determine ay. 
(ii) Let 6, denote the number of sequences of 0’s and 1’s with no 
two consecutive 1’s and for which a run of 0’s always has length 2 
or 3, including possibly at the beginning or end of the sequence. 
Show that bi! " _, © for some ¢ and approximate c. 


Example 14.4. Let a(r,n), where 0 <r <n, denote the number 
of solutions of the problem of Example 13.1 (a(0,0) = 1). We 
divide the set of possible sequences into two subsets: those with 
x; = 1 and those with x, > 1. The first subset clearly contains 
a(r — 1,n — 2) elements, the second one a(r,n — 1) elements. So 


(14.2) a(r,n) =a(r,n—1) +a(r —1,n—- 2) (n> 1). 
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From this recursion, we can prove the result a(r,n) = ("" eae by 
induction. Using the generating function is more difficult. Try 


From (14.2) we find 


fey) =1+e+ayt+ax(-14 f(z,y)) +2°yf(z,y), 


i.e. 


1l+2y gee rae 
DOWNS a Le ee yer 


Substitution of (10.6) for (1—x)~*! produces the required binomial 
coefficient for a(r,7). 


As we saw in Example 14.3 (and Problem 14A) a linear recursion 
with constant coefficients leads to a rational function as generating 
function (and vice versa). Indeed, if ay, = oe ok QAKAn—k (rn > 1) and 
F(x) = oP.5 @nz”, then (1 — Ss ,.-1 ane") f(a) is a power series for 
which the coefficient of x” is 0 ifn > l. 

The following example due to Klarner (1967) shows an inter- 
esting case of a linear recursion that is found with the help of the 
generating function and that is not at all obvious in a combinatorial 
way. 


Example 14.5. Consider configurations in the plane, called poly- 
ominoes, as in Fig. 14.1. The configuration consists of layers, each 
consisting of consecutive squares. Two consecutive layers meet 
along the edges of a number (> 1) of squares. (More precisely, 
we are considering horizontally convex polyominoes.) 


12 
10} 11 


No) 


1)2)3)4)]5 


Figure 14.1 
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Let a, denote the number of polyominoes with n squares and 
define f(x) := >, a,x". To find f, we introduce a(m,n) for the 
number of polyominoes for which the bottom layer has m squares 
(and a total of n). We define a(m,n) := 0 if m > n. Clearly 


(14.3) =) \(m+1-1a(l,n—m). 
l=1 


We define 


(14.4) Fey) = Ss S- alii. ney: 


n=1 m=1 


Then f(x) = F(a,1). Because the series will turn up below, we 


also define 
oS Se S- ma(m, n)x 


n=1 m=1 


We would like to write 


(14.5) ge) = Gs 


Even though we have a theory of formal power series, it may be 
instructive to show that the righthand side of (14.4) converges in 
a sufficiently large region. This gives us the opportunity to show a 
quick way of getting a rough estimate for a,. Number the squares 
of the polyomino in the obvious way as in Fig. 14.1. To each square 
associate a quadruple (20,21, 22,273) of 0’s and 1’s, where xp = 1 
means that there is a square of the polyomino below this square, 
x; = 1 means that there is a square of the polyomino to the left, 
x2 = 1 means that there is a square above, and 73 = 1 means 
that there is a square to the right. For example, in Fig. 14.1 the 
first quadruple is (0,0,1,1). The sequence of quadruples uniquely 
determines the polyomino (e.g. the fifth quadruple is the first one 
in the sequence that ends in a 0, showing that m = 5, etc.). This 
shows that a, < 15". From this and (14.3) we find a(m,n) < 
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n-15"~™ which is enough to justify (14.5). From (14.4) we find by 
substituting (14.3) and a straightforward calculation 


(14.6) Fig) = 


g(x). 


vy (xy)? vy 
aa (i — aye) + l—-2zy 


Differentiation of both sides of (14.6) with respect to y and taking 
y =1 yields (using (14.5)): 


x Dx? x 
(14.7) Oe): = Gap + ai ae + 


a — ays): 


From (14.7) we can find g(x) and substitute this in (14.6); then 
take y = 1, which yields 


x(1—-2)3 


ee) ie) 1 — 5x2 + 7x? — 4a" 


From (14.8) we see that a, satisfies the recurrence relation 
(14.9) Qn = 5A@n_1 — Tan—2 + 4An—3 (wes). 


As we remarked above, it is not at all clear how one could prove 
this directly; it has been done however. 


Remark. From (14.9) we find that lim, .. ai!” = 0, where 0 is the 


zero with largest absolute value of the polynomial x? — 5x? + 7x — 4 
(0 = 3.2). 


The following example produces a result that is important for the 
theory of finite fields. In this example, we combine a generalization 
of the idea of Example 14.2 with the method of formal operations 
with power series. The reader should convince herself that the 
operations with logarithms are correct without using convergence. 


Example 14.6. We shall count the number of irreducible polyno- 
mials of degree n over a field of gq elements. 

Number all the monic irreducible polynomials of degree at least 
one over a field of g elements: 


Fil@) fa Be Tala) ce 
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with, say, respective degrees d1, da, d3,... . Let Ng denote the num- 
ber of degree d, d= 1,2,3,... . 
Now for any sequence 21, 72, 73,... of nonnegative integers (all but 


finitely many of which are zero), we get a monic polynomial 


f(x) = (filx))" (fol)? (fa(a))* --. 


whose degree is n = 71d, + todo + i3d3 +--- . By unique factor- 
ization, every monic polynomial of degree n arises exactly once in 
this way. To repeat, there is a one-to-one correspondence between 
monic polynomials of degree n and sequences 21, 72,... of nonneg- 
ative integers satisfying n = 11d, + todo + igd3+--- . 

Of course, the number of monic polynomials of degree n is q”, 
i.e. the coefficient of x” in 


1 
=1+ qe + (qx) +(qz)?+---. 
1-— qx 
Clearly (cf. Example 14.2), the number of sequences 71, 7%2,... with 
n = 14d, + todg +... is the coefficient of x” in the formal power 


series 
(Lta® po24 4 93h 4... )(1pe2@ 4 g2 4 gi 4...) 


Thus we conclude 


1 -T] 1 Wf \™ 
l—qr aa: 1-24 ; 


i=1 


Recalling that log + = 2+ 527 +422 +---  , we take (formal) 
logarithms of both extremes of the above displayed equation to 
find 


Co CO 


ye a) OND 


n=1 = J 


git 


Then, comparing coefficients of x” on both sides, we get 


q” = 1 
a = dL Nee 


d\n 
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i.e. 


= Sl dN. 


d\n 


This last equation was our goal. We have derived it combinatori- 
ally from unique factorization, but we remark that it has in fact an 
elegant interpretation in terms of the factorization of x7” — x over 
the field of q elements. Applying Mébius inversion (cf. Theorem 
10.4), we obtain the following theorem. 


Theorem 14.1. For a prime power q, the number of monic irre- 
ducible polynomials of degree n over the field of q elements is given 


by 


1 n 
N,=— sgt. 
rs % u(5)4 


We note that a direct consequence of Theorem 14.1 is that Ng > 
0, that is, there exist irreducible polynomials of every degree d. 
This immediately leads to a proof of the existence of fields of p4 
elements for every prime p, without the usual expedient of reference 
to the algebraic closures of the prime fields. 

* OK OK 


We now come to a large class of counting problems, all with 
the same solution. The class is known as the Catalan family be- 
cause Catalan (1838) treated Example 14.7. Actually, the equiva- 
lent problem called Question 3 in Example 14.9 below was treated 
by von Segner and Euler in the 18th century. We denote the so- 
lutions to the problems by u,,. We shall show that u, = ten), 
These numbers are called Catalan numbers. 


Example 14.7. Suppose that we have a set S with a nonasso- 
ciative product operation. An expression 171%2...%, with 7; € S 
does not make sense, and brackets are needed to indicate the order 
in which the operations are to be carried out. Let u,, denote the 
number of ways to do this if there are n factors x;. For example, 
i= +(3) = 5 corresponding to the products (a(b(cd))), (a((bc)d)), 
((ab)(cd)), ((a(bc))d), and (((ab)c)d). Each product contains within 
the outer brackets two expressions, the first a product of m factors 
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and the second a product of n — m factors, where 1 <m<n-1. 
It follows that the numbers u,, satisfy the recurrence relation 


n-1 


(14.10) Un = UmUn—m (n z 2). 


m=1 


From (14.10) and u,; = 1, we find that the generating function 
f(x) := SO, unx” satisfies the equation 


m=1 


(14.11) =a+ 3 (& UmUn— n)2 "=a+(f(2))’. 


Solving the quadratic equation and taking into account that f(0) = 


0, we find 
1-—vV1—-4¢ 
f(x) = —S—. 


From the binomial series we then obtain 


(14.12) cae ( 7 a 


n\n-1 


The operations carried out above can all be justified as formal op- 
erations on power series, and the binomial series can also be treated 
formally without being concerned about convergence. If one feels 
uneasy about this approach, there are two remedies. The first is 
to find the solution as we did above and afterwards prove that it is 
correct by using (14.10) and induction. If one really wishes to be 
sure that the generating function is defined by a convergent power 
series, then use the result of the calculation to find some rough es- 
timate for u,, and prove that the estimate is correct using (14.11), 
and then show that the series converges. For example, we could 
try Un < c"!/n?. This is true for n = 1 and any positive c and it 
is true for n = 2 if c > 4. The induction step with (14.11) yields 


n-1 io 


tee Y ae a a oa! 


m=1 m=1 
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ite > om. So the radius of convergence of }>° , unx"” is positive. 


Remark. Formula (14.10) shows that un > UmUn—-m, and then 
Fekete’s lemma implies that lim,_.., ul!” exists. In fact, (14.12) 
now shows that this limit is 4. 


As we remarked above, the Catalan numbers reappear regularly 
in combinatorial counting problems. This has led to the question 
of proving the other results combinatorially, i.e. by showing that 
there is a one-to-one correspondence between the objects to be 
counted and those of Example 14.7. Some of these will be treated 
in Example 14.9. First we look at another famous counting problem 
that has the Catalan numbers as solution. 


Example 14.8. Consider walks in the X-Y plane where each step 
isU: (a,y) — (a@+1,y+1) or D: (x,y) — (4@+1,y—1). We start 
at (0,0) and ask in how many ways we can reach (2n,0) without 
crossing the X-axis. The solution of this problem uses an elegant 
trick known as André’s reflection principle (1887). In Fig. 14.2 we 
consider two points A and B in the upper halfplane and a possible 
path between them which meets and/or crosses the X-axis. 

By reflecting the part of the path between A and the first meeting 
with the X-axis (i.e. C in Fig. 14.2) with respect to the X-axis, we 
find a path from the reflected point A’ to B. This establishes a 
one-to-one correspondence between paths from A’ to B and paths 
from A to B that meet or cross the X-axis. 


Figure 14.2 


It follows that if A = (0,4) and B = (n,m), then there are Co 
paths from A to B that cross or meet the X-axis, where 2], := n— 
k—m. Since there are (;”) paths from A to B, where 2lz := n—m+k, 


lg 
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we find (7,) — (7') paths from A to B that do not meet the X-axis. 
Any path from (0,0) to (2n,0) in the upper halfplane that does not 
meet the X-axis between these points goes from (0,0) to (1,1):=A, 
from A to B := (2n — 1,1) without meeting the X-axis, and then 
from (2n — 1,1) to (2n,0). By the argument above, we find that 
there are u,, such paths. If we allow the paths to meet the X-axis 
without crossing, then there are u,4, such paths. 

We remark that the number of paths from (0,0) to (2n,0) in the 
upper halfplane that do not meet the X-axis between these points is 


equal to the number of sequences of zeros and ones (21, 22,..-, 2n) 
with 
1, : 
<5j forl<j<2n-1, 
(14.13) byte te te | J nae 
=i 1ong = 20. 


The correspondence is given by letting a 1 correspond to a step D 
of the path. 


We now show by combinatorial arguments that several counting 
problems lead to the Catalan numbers. 


Example 14.9. Consider the following three questions. 


Question 1. A tree that is drawn in the plane is called a plane 
tree. How many rooted plane trees are there with n vertices and a 
root with degree 1 (so-called planted plane trees)? 


Question 2. A planted plane tree is called trivalent or a binary 
tree if every vertex has degree 1 or 3. How many trivalent planted 
plane trees are there with n vertices of degree 1? 


Question 3. In how many ways can one decompose a convex 
n-gon into triangles by n — 3 nonintersecting diagonals? 


The first correspondence we shall show is between Question 2 
and Example 14.7. 
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For this, Fig. 14.3 suffices. 


(x1 ((t2%3)(%4%5))) S 


Figure 14.3 


It follows that the solution to Question 2 is u,_1. 


The correspondence between Question 1 and Example 14.8 is also 
seen from a figure, namely Fig. 14.4. 


S UUUDUDUDDUDD 


Figure 14.4 


In the figure, the tree can be described by a walk around it as 
shown by the dotted line. This walk can be described by calling 
the steps U for a step going up, and D for a step going down. This 
yields a sequence of twelve steps. Replacing U by 0 and D by 1 
yields a sequence that obviously satisfies (14.13). This shows that 
the solution to Question 1 is also un_}. 

Finally, we show a correspondence between Question 3 and Ques- 
tion 2. Consider an n-gon decomposed into triangles and distin- 
guish some edge. We now construct a tree as in Fig. 14.5. 
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~--O----- 
Figure 14.5 


The tree is a binary tree with vertices of degree 1 corresponding 
to the sides of the n-gon, and vertices of degree 3 corresponding to 
the triangles. The tree is planted at the vertex corresponding to 
the special edge. So again we find that the number of solutions is 
Un—1- 


Problem 14B. Describe (with a figure) a correspondence between 
the graphs of Question 1 and those of Question 2. 


Problem 14C. Find a direct one-to-one mapping from the n-gons 
of Question 3 to the products of Example 14.7. 


We now turn to several problems in which the exponential gen- 
erating function turns out to be useful. We shall give a somewhat 
more systematic treatment for this case. For a complete treatment 
of the methods, we refer to A. Joyal (1981). 

Let M denote a ‘type’ of combinatorial structure. For example, 
trees, polygons, sets (the ‘uniform’ structures), permutations, etc. 
Let mz be the number of ways of giving a labeled k-set such a 
structure. In each separate case we shall specify whether we take 
mo = 0 or mp = 1. We use capital letters for the structure and 
lower case letters for the counting sequence. We define 


2 k 
x 
k=0 


So if T denotes the structure (labeled) tree, then we know from 
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Theorem 2.1 that 


= 4-2" 
T(z) = Sok a 
k=0 
If S denotes the uniform structure (a set) then s, =1 for all & and 
therefore S(x) = e*. If C denotes ‘oriented circuit’, then we can 
start at vertex 1 and there are clearly (k — 1)! ways to proceed. 
Therefore C'(a) = —log(1 — 2). 

Suppose we wish to consider the number of ways a labeled n-set 
can be partitioned into two parts, one with a structure of type A 
and the other with a structure of type B. It is easy to see that the 
number of ways that we can do this is }°;_9 a apbn—,. It follows 
that if we call this a structure of type A- B, then 


(14.15) (A-B)(«) => (>: @ wher) — = A(x) - B(x). 
n=0 \k=0 
The reader should be careful with the result (14.15). The asser- 
tion is only true if, from the compound structure there is a unique 
way of finding the partition into the two parts that led to this struc- 
ture. For example, if A and B are the same, the result is clearly 
false. 


Example 14.10. Once again we consider derangements. Call this 
a structure of type D and let II denote the structure ‘permutation’. 
Clearly I(x) = (1—«)~!. Any permutation consists of a set of fixed 
points (that we interpret as just a set) and a derangement on the 
remaining points. So by (14.15) we have 


(1-2) '=D(a)-5(z), ie. D(z) =e7(1— 2) 
as was shown in Example 10.1. 


Example 14.11. In how many ways can a labeled n-set be split 
into a number of pairs (= P) and a number of singletons (= S)? 
First, observe that if we wish to split 2k points into pairs, we must 
choose a point x; to go with point 1 and then split the remaining 
2k — 2 points into pairs. Therefore po; = (2k —1)!! and we find that 


02 2k 
P(x) =5_ (2k - I! opr = exp(52) 
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It follows that 
L 
(14.16) (P -S)(x) = exp(a# + 5t 


Let us now try to find the same result using a recursion relation. 
We denote the structure P-S by B. In the set {1,2,...,n} we can 
either let n be a singleton or make a pair {z,n} with 1 <a <n-l. 
So 

bp = bn—-1 + (n — 1)bp—2 (n > 1); 


It follows that 
Bix) = (1+ 2)B(a), 


and since B(0) = 1, we again find (14.16) as solution. 
We return to the recurrence of Example 14.11 in Example 14.15. 


Problem 14D. Consider the structures Mop:= mapping of a set 
to itself with no fixed point, M,:= mapping of a set to itself with 
exactly one fixed point, and A:= arborescence. Find a relation 
between Mo(x), M(x), and A(x) and check the first few terms to 
see if the result is correct. 


We make things slightly more difficult. We wish to partition an 
n-set into parts and then impose structure N on each of the parts, 
where nop = 0. We claim that the exponential generating func- 
tion for the compound structure is exp(N(x)). A simple example 
is the case where N is the uniform structure (with the convention 
no = 0) and hence N(x) = e* — 1. The compound structure is of 
course ‘partition’ for which we know from Theorem 13.6 that the 
exponential generating function is equal to 


= = ax” ae (sees 
ye (> S(n, 0) = Ss ew = exp(e* — 1) = exp(N(z)). 


n=0 \k=0 k=0 
We formulate this method as a theorem. 


Theorem 14.2. If the compound structure S(N) is obtained by 
splitting a set into parts, each of which gets a structure of type N, 
then 

S(N)(x) = exp(N(«)). 


144 A Course in Combinatorics 


ProorF: By a slight generalization of (13.3) we see that if the par- 
tition of the n-set consists of b; parts of size 1, b parts of size 2, 
..., bg parts of size k, where 6; + 2b2+---+kb, =n, then there are 


Gi) aa 


ways to make the compound structure, and this has to be divided 
by n! to obtain the contribution to the coefficient of x” in the 
exponential generating function. If b}+---+b, = m, then this same 
contribution is found in exp(V(x)) from the term (N(x))™/ml, 


namely as 
1 m x\*1 ak \" 
m! \by,..., bp (m7) ee 


This proves the assertion. 


In fact, it is not difficult to see that this theorem is a special 
case of a more general method. Interpret the previous situation 
as follows. We have a uniform structure on a k-set of points and 
we replace each point by some structure of type N. The resulting 
configuration is an element of the compound structure that we just 
discussed. If the first of the structures is not uniform but, say, of 
type R, then the exponential generating function for the compound 
structure will be R(.N(x)) by the same argument as we used above. 
We sometimes call this procedure the substitution of N into R, 
and it is nice that we then must do the same for the generating 
functions! 


Example 14.12. If we substitute the structure ‘oriented cycle’ 
into the uniform structure, then we are considering the compound 
structure consisting of a partition of an n-set into oriented cycles, 
i.e. the structure II with as usual 7 = 1. So we must have II(x) = 
exp(C(z)) and, indeed, II(z) = (1 — x)~! and C(x) = —log(1— 72). 


Example 14.13. Let us look at Fig. 2.2 again, adding loops at 1 
and 21. We then have a description of a mapping of an n-set (with 
n = 21 in this case) to itself. But we can also interpret the figure as 
an element of the structure II, namely (1)(4, 5, 3)(7)(20, 12)(21) in 
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which each point has been replaced by an arborescence with that 
point as root. Letting A denote arborescence, we find from Cayley’s 
theorem, Theorem 2.1, that 


_ gute 
(14.17) A(x) = Son ae 


n=1 


(The extra factor n comes from the choice of the root.) Since there 
are n” mappings from an n-set to itself, the method of Theorem 
14.2 shows that the following relation must hold: 


Co 


(14.18) TI(A(x)) = aw = 


n=0 
We shall verify this after the next example. 


The following well known result from complex analysis often 
plays a role in combinatorial problems involving generating func- 
tions. It is known as the Lagrange inversion formula. It may be 
found in textbooks on analysis with analytic proofs; see the notes. 
It is possible to prove the theorem within the theory of formal power 
series (using formal derivation, etc.) and we give such a formal proof 
in Appendix 2. 


Theorem 14.3. Let f be analytic in a neighborhood of z = 0 and 
f(0) 40. Then, if w = z/f(z), z can be expressed as a power series 
z=) %, chw" with a positive radius of convergence, in which 


k-1 
(14.19) =a (=) ren] 


Example 14.14. We shall find a fourth proof of Theorem 2.1. Let 
T be the structure ‘labeled tree’. We wish to show that t, = n”~? 
for n > 1. If A denotes arborescences as before, then obviously 
An = Nty, ie. A(x) = xT’(x). Furthermore, from Theorem 14.2 
we see that exp(A(z)) is the exponential generating function for 
the structure ‘rooted forest’? (= F’). Consider a labeled tree on 
n+ 1 vertices as an arborescence with vertex n + 1 as its root and 


z=0 


146 A Course in Combinatorics 


then delete the root and all incident edges. The result is a rooted 
forest on n vertices. Since we can reverse this process, we have a 
one-to-one correspondence. Therefore 


: oo gn oo gr 7 
(14.20) eM) =14+ 0 fr =) tn = Tn) = 2 Ala), 
n=1 n=0 
1.e. 
A(z) 
(14.21) “aay = 


We apply Theorem 14.3 to (14.21) with z = A(x), f(z) = e? = 
eA), w= a. We find A(x) = 772, cex* with 


k-1 k-1 
k! dz = k! 


and it follows that t, = n”~?. Furthermore we see that the number 
of labeled rooted forests on n vertices is (n + 1)""1. 


Remark. We now have verification of (14.18) because (14.20) im- 
plies that A’(x) = e4@) + reA@ A'(z), ie. 


— a” eA) A(x) 
ih aiilee — A’ — — ‘¢ 
> a ae (2) 1—zxeA®) 1- A(z) 


Remark. The procedure of removing a point from a combinatorial 
structure with n+1 points that we used in Example 14.14 is called 
derivation and it indeed corresponds to the derivation of generating 
functions. Another example is removing a vertex from an oriented 
cycle on n + 1 vertices. This yields an oriented path on n vertices, 
of which there are n!. So their exponential generating function is 
(1 —)~! and that is C’(z). 


Problem 14E. Let a, denote the number of ways of decomposing 
a convex n+ 1-gon into quadrilaterals by inserting a number of 
nonintersecting chords. By convention ap = 0, a; = 1. Show that 
ktlim=n UkAldm = An for n > 3. If f(x) is the ordinary generating 
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function for the sequence a,, then find a functional equation for 
f(x) and solve this equation using Theorem 14.3. (We remark that 
the result that is obtained can also be proved combinatorially. ) 


Although we have stressed that in most cases the power series 
can be considered as formal power series, it has been clear in a 
number of examples that analysis can be an important tool in many 
combinatorial problems. We give one more example that shows a 
method that can be used for many recurrences. 


Example 14.15. In Example 14.11 we encountered the recursion 
(14.22) An = An—-1 + (n — 1)an-2 


and the corresponding exponential generating function. In previous 
examples we showed that the counting function grows more or less 
as c” for some constant c. Clearly a, grows more rapidly, but what 
is a good approximation for its asymptotic behavior? The first step 
in obtaining the answer to this question involves a method that is 
applicable to many recurrences. In (14.22) we substitute 


(14.23) an = f veer ae 
C 


where C’ is a path in C that we can still choose and w is a function 
also still to be determined. We substitute (14.23) into (14.22). 
The term (n — 1)an_» yields [, ¥(z)(n — 1)z""7 dz, and we require 
that integration by parts yields only the integral involving w'(z), 
the other term being 0 by a suitable choice of C. Then (14.22) 
becomes 


(14.24) [ {o(z)[z" — 2] + p'(z)z""} dz =0, 


which is true for all n € N if b(z)(1—z) = W"(z), ie. Y(z) = ae? 2”. 
Once we know w, the requirement on C’ shows that the real axis 
from —oo to oo is a good choice for the path of integration. From 
ag = 1 we find that a = (2me)~?. 

To find the asymptotic behavior of a,, we must now analyze the 
behavior of 


(14.25) eo eh 2” x” dex. 
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Since the integrand is maximal near x = \/n, we substitute 7 = 
y+ /n and find 

(14.26) 


T= er 2amtvigan eyo ay? exp (- yVn + nlog(1 + +) dy. 


Now note that if u and v are negative, then |e“ — e’| < |w—v| and 


use the representation log(1 + t) = oo =3> = + fe is ds to 
obtain yl 
y Y y 
—yVn+ nlog(1+ + ss ; 


Substitution in (14.26) shows that the integral tends to 


is ey * dy = mel. 


We have proved that 


1 
e 4 
14.27 An © name artvn 


It is clear that this result cannot be found in an easy way! 


Problem 14F. Let F,,(x) denote the expansion of (1 — 2”)~#(™)/" 
in a power series. Also consider the expansion of e* as a formal 
power series. Prove that 


— II F(a) 
n=l 


is true as a relation between formal power series. 


Problem 14G. Find the exponential generating function for the 
number of symmetric n x n permutation matrices. 


Problem 14H. On a circle we place n symbols 0 and n symbols 
1 in an arbitrary order. Show that it is possible to number the 
positions on the circle consecutively from 1 to 2n such that the 
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sequence satisfies (14.13) if we replace the strict inequality ‘<’ with 
be, 


Problem 14I. Let the points 1,...,2n be on a circle (consecu- 
tively). We wish to join them in pairs by n nonintersecting chords. 
In how many ways can this be done? 


Problem 14J. Find the exponential generating function for the 
number of labeled regular graphs of valency 2, with n vertices. 


Problem 14K. Consider walks in the X-Y plane where each step 
is R: (x,y) > («+1,y) or U: (2, y) — (a, y+1). We start at (0,0) 
and ask in how many ways we can reach (2n,2n) without passing 
through one of the points (2i — 1,221) ,i=1,...,n. Prove that 
this number is the Catalan number wu2),+1. 


Problem 14L. Consider walks in the X-Y plane where each step 
is R: (ay) — (a+ 1,y) or U.: (a,y) — (2,y 4+ a), withaa 
positive integer. There are five walks that contain a point on the 
line x+y = 2, namely: RR, RU), U;R, U,U;, and Uy. Let a, denote 
the number of walks that contain a point on the line x + y = n (so 
a2 = 5). Show that a, = Foy, where F;, are the Fibonacci numbers 
starting with Fo = F, = 1. 


Problem 14M. Count pairs (f,g) of functions f, g, where f maps 
{1,2,...,r} to {1,2,...,n} and g is a permutation of {1,2,...,n} 
that fixes the image of f pointwise. Doing this in two ways prove 


that forn >r 
S- (;) ki dink — B(r) : ni, 


k=1 


where d,, denotes the number of derangements of 1,2,...,m and 
B(r) is a Bell number. 


Problem 14N. Consider walks in the X-Y plane where each step 
is R :(a,y) > (@ + 1,y), U s(a,y) > (@,y +1), or D: (2, y) > 
(c+1,y+1). We wish to count walks from (0,0) to (n,n) that are 
below or on the line x = y. Find a recurrence for the number of 
walks like (14.10). 
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Notes. 

For an extensive treatment of the material in this chapter, we 
refer to Goulden and Jackson (1983) and Stanley (1986). A very 
readable survey is Stanley (1978). 

The first of the references mentioned above has much on the the- 
ory of formal power series. For this topic also see Niven (1969) and 
Appendix 2. Generally speaking, operations with power series are 
‘formal’ and require no discussion of convergence when the coeffi- 
cient of any monomial x”, say, in a sum or product is determined 
by a finite number of operations (i.e. no limits). For example, 
yo o(5 + 2)” is not allowed in formal operations since even the 
constant term is not a finite sum; but )>* 9(nla + x*)" is allowed 
(even though it does not converge for x 4 0) since the coefficient 
of any power of x requires only a finite computation (cf. Problem 
14F). 

The first linear recurrence that one usually learns to solve is 
An41 = An + Gn_1. It leads to the famous Fibonacci sequence; cf. 
Problem 14A. As we observed in Chapter 10, this name is due to 
Lucas. The sequence is related to a problem occurring in the book 
Liber abaci (1203) by Leonardo of Pisa (known as Fibonacci). 

The number a, of polyominoes of size n, treated in Example 
14.5, was shown to be less than c” (where we used c = 15). Note 
that this assertion also follows from Fekete’s lemma. We derived 
the inequality by using a suitable coding for the cells. The reader 
may wish to try to prove that a(m,n) < m-c" for a suitable 
c by using (14.3) and induction. The remark following (14.9) is 
based on a statement in Stanley (1986), attributing an unpublished 
combinatorial proof to D. Hickerson. 

The result of Example 14.6 is related to the fact that F,» has 
as subfields the fields F,,a, where d divides n. From this, one finds 
that x?” — x is the product of all the irreducible polynomials over 
F, with a degree d dividing n. This yields the formula proved in 
the example. 


Despite the fact that there is extensive literature on all the prob- 
lems with the Catalan numbers as solution, the problems reappear 
regularly (e.g. in problem sections of journals). As stated earlier, 
the Belgian mathematician E. Catalan (1814-1894) studied well- 
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formed bracketings (as in Example 14.7). 

The reflection principle of Fig. 14.2 was used by the French com- 
binatorialist D. André (1840-1917) in his solution of Bertrand’s 
famous ballot problem: if, at the end of an election, candidate P 
has p votes and @ has q votes, p < q, then the probability that Q 
was ahead all the time during the election is (q — p)/(q +p). 

The Lagrange inversion formula is one of the many contributions 
to analysis by J. L. Lagrange (see notes to Chapter 19). For a 
proof see §7.32 in Whittaker and Watson (1927). The theorem was 
published in 1770. We also refer the reader to G. N. Raney (1960). 
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Partitions 


We have considered several partition problems in the previous chap- 
ters. We now come to the most difficult one, namely the problem 
of unordered partitions of n into k parts, respectively any number 
of parts. We define p;,(n) as the number of solutions of 


(15.1) nNm=4y+Ho+---+ 2¢, CS ee Ss > ae > 1. 


For example, 7=5+1+1=4+4+24+1=3+4341=3+42+42,s0 
p3(7) = 4. 

Using the same idea as in the Corollary to Theorem 13.1, we see 
that p;,(m) equals the number of solutions of n — k = yy +--++ yp 
with y; >--- > y, > 0. If exactly s of the integers y; are positive, 
then by (15.1) there are p,(n — k) solutions (yi,...,y,). Therefore 


(15.2) px(n) = )_ p(n — ki). 


Problem 15A. Show that pz(n) = pr-i(n — 1) + p(n — k) and 
use this to prove (15.2). 


Since we have the trivial initial conditions p;,(n) = 0 forn < k 
and p,(k) = 1, we can recursively calculate the numbers p;(n). 
Clearly pi(n) = 1 and po(n) = |n/2]. 


Example 15.1. We shall show that p3(n) = {5}, i.e. the number 
nearest to - using a method that can be used for other values of 
k. Let a3(n) denote the number of solutions of n = 2, + 179+ 23, 
Ly, > X2 > x3 > 0. Then az(n) = p3(n + 3) and writing y3 = 23, 
y2 = £2 — 2X3, Yi = L1 — Lo, we see that a3(n) is the number of 
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solutions of n = y; + 2y2 + 3y3, y; > 0,7 = 1,2,3. Therefore (cf. 
Example 14.2) 


(15.3) Yast hes tes) deer. 


Let w = e?"'/3, The partial fraction decomposition of (15.3) yields 


1 Le 


1 3 -2 = 4 
(15.4) ) Lavin eis a tle) + 7-2) + 75 (1 — 2) 
1 1 1 
+ s(l+2)' + 5d -wa) + o(l-w's)™. 


Using (10.6), we find from (15.4) 


1 rs (—1)" 1 2n 
ere) = 75 + 3 + 9(u" +0"), 


a3(n) = 


and this implies that 


Therefore 
(15.5) p3(n) = {n’}. 


Problem 15B. We choose three vertices of a regular n-gon and 
consider the triangle they form. Prove directly that there are 
{n*} mutually incongruent triangles that can be formed in this 
way, thereby providing a second proof of (15.5). 


We shall now show that (15.5) is an example of a more general 
result. 


Theorem 15.1. Ifk is fixed then 
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Proor: (i) If n = a1 +---+ 4%, 41 > ++: > xy > 1, then the 
k! permutations of (1,...,2,) yield solutions of (13.2) in positive 
integers, not necessarily all different. So 


(15.6) k! p(n) > i ‘ i) 


Ifm =a, +--+ +44, 01 > +++ > ay > 1, then if y; := x; + (k — 1), 
1<i<k, the integers y; are distinct and yj +---+y,z = n+ Mee. 


Therefore 


Kk-1) 4 
(15.0) k! p(n) < (" i” k eS i ). 


The result follows from (15.6) and (15.7). 


Problem 15C. Let ai,a2,...,a; be positive integers (not neces- 
sarily distinct) with greatest common divisor 1. Let f(n) denote 
the number of solutions of 


nN = a,x, + ag%2 +--+ + Atxt 


in nonnegative integers 71,...,2;. What is the generating function 
F(x) := ¥o f(n)x"? Show that f(n) ~ cn‘! for some constant c 
and explicitly give c as a function of a1,..., a. 


The problem that we considered in the Corollary to Theorem 
13.1 concerns what is called the number of compositions of n into 
k parts. By this corollary the total number of compositions of n 
equals )y_, (7 }) = 2”-1. The method that we used in Theorem 
13.1 shows this directly as follows: consider n blue balls with a 
space between any two of them in which we can place a red ball 
if we wish. The total number of configurations that can be made 
in this way is clearly 2”~' and they represent all compositions of 
n. We could have proved the same result without our knowledge of 
Chapter 13 as follows. Let c,, denote the number of compositions 
of n into k parts. Define 


CO 
Ca) = Ss" On nae 
n=k 
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Then, using a method that we have used several times before, we 
find 


cr (x) = erattatt--Paa-ayt= (Foo) e" 


n=k 


and 


lee) lee) 7 i 
dee) = Dpa(1-a2)* = 7 er 
k=1 k=1 


i.e. n has 2”-! compositions. 


Problem 15D. Show that in a list of all 2"~! compositions of n, 
n > 4, the integer 3 occurs exactly n+ 2”~° times. 


The most difficult and mathematically the most interesting and 
important function of all in this area of combinatorics is defined by 


(15.8) p(n) := number of unordered partitions of n. 


For example, p(5) = 7, the partitions of 5 being 


1+1+14+141 = 24+14+141 = 34141 = 24-241 = 441 = 342 =5. 
Clearly, p(n) is the number of solutions of 
(15.9) NM=et+tot::'+2y (ay: Ste > ac 0). 


As we saw above, this can also be formulated as the number of 
solutions of n = y,+2y2+---+nYyn, where y; >0,1<i<n. (Here 
y; denotes the number of terms x, equal to 7 in the first definition.) 

The study of the so-called partition function p(n) has led to some 
of the most fascinating areas of analytic number theory, analysis, 
algebraic geometry, etc. We shall obviously only be able to look at 
properties that are proved by combinatorial arguments. 


Theorem 15.2. The generating function for the partition function 
is 
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PROOF: We again use the idea of Example 14.2. The partition of 
n=)>", ir; with r; terms equal to i (for 1 < i < m) corresponds 
to the term xj'--- az" in the (formal) infinite product [[(1 + x + 


zz +---). Replacing x, by x", we find the result. 


Many theorems about partitions can be proved easily by repre- 
senting each partition by a diagram of dots, known as a Ferrers 
diagram. Here we represent each term of the partition by a row 
of dots, the terms in descending order, with the largest at the top. 
Sometimes it is more convenient to use squares instead of dots (in 
this case the diagram is called a Young diagram by some authors 
but we shall use Ferrers diagram as the name for the figure). For 
example, the partition (5,4,2,1) of 12 is represented by each of the 
diagrams of Fig. 15.1. 


Figure 15.1 


The partition we get by reading the Ferrers diagram by columns 
instead of rows is called the conjugate of the original partition. So 
the conjugate of 12 =5+4+2+4+1is12=4+3+4+2+4+2+41. The 
relationship is symmetric. We first show a few easy examples of 
the use of a Ferrers diagram. 


Theorem 15.3. The number of partitions of n into parts, the 
largest of which is k, is p,(n). 


PROOF: For each partition for which the largest part is k, the 
conjugate partition has & parts (and vice versa). 


Problem 15E. Show with a Ferrers diagram that the number of 
partitions of n +k into k parts equals the number of partitions of 
n into at most k parts. (This is (15.2).) 


In some cases a proof using generating functions requires less 
ingenuity than using a Ferrers diagram would. The next theorem 
can be proved with a diagram similar to a Ferrers diagram though 
we shall use generating functions instead. 
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Theorem 15.4. The number of partitions of n into odd parts 
equals the number of partitions of n into unequal parts. 


PrRooF: The generating function for the number of partitions of n 
into odd parts is (by an obvious generalization of Theorem 15.2) 
equal to [[>-_,(1 — 27”~')~', and the generating function for the 
number of partitions of n into unequal parts is (by the same argu- 
ment as used in the proof of Theorem 15.2) [][~.,(1 +2"). Since 


5 k _ py (l= 2”) 
ie =a 


(Q-2)[Ja-e/)*=]Ja-2)7, 


k= l=1 m=1 


rae 


the proof is complete. 


Now consider the function (P(x))~! = []7_,(1 — 2"). In the ex- 
pansion of this product, a partition of n into unequal parts con- 
tributes +1 to the coefficient of x” if the number of parts is even, 
and —1 if the number of parts is odd. So the coefficient of x” is 
Pe(n) — po(n), where p-(n), respectively p,(n), denotes the number 
of partitions of n into an even, respectively odd, number of unequal 
parts. It was shown by Euler that p.(n) = p,(n) unless n has the 
form n = w(m) := (8m? — m)/2 or n = w(—m) = (3m? + m)/2, 
in which case p.(n) — p(n) = (—1)". The numbers w(m) and 
w(—m) are called pentagonal numbers. This name is explained by 
the relation w(m) = 31") (3k + 1) and Fig. 15.2. 


sy @ OP 


1 144=5 14447=12 14447410=22 
Figure 15.2 


The following extremely elegant pictorial proof of Euler’s identity 
is due to Franklin (1881). 


158 A Course in Combinatorics 
Theorem 15.5. We have 
[[a = a") Se S$ >(-1)™ (oan a aeoi) : 
r=] m=1 


PROOF: Consider a Ferrers diagram of a partition of n into unequal 
parts as for 23 = 7+6+5+4342 in Fig. 15.3. 


i 


Figure 15.3 


The last row is called the base of the diagram. The number of 
dots in the base is denoted by b. In Fig. 15.3 the base is indicated by 
a line segment with b = 2 dots. The longest 45° line segment joining 
the last point in the top row with other points of the diagram is 
called the slope of the diagram. Its number of points is denoted 
by s. In Fig. 15.3 the slope is indicated by a segment with s = 3 
points. We now define two operations on the diagram, which we 
call A and B. 

A: If 6 < s, then remove the base and adjoin it to the diagram 
at the right to form a new slope parallel to the original one, unless 
b = s and the base and the slope have a point in common. This 
exception is considered below. 

B: If b > s, remove the slope and adjoin it at the bottom of the 
diagram as a new base, except if b = s+ 1 and the base and the 
slope have a point in common. 

In Fig. 15.3 operation A can be carried out and it yields the 
partition 23 = 8+7+5+3. An example of the exceptional situation 
for operation B is shown in Fig. 15.4. 


Figure 15.4 
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The exception for operation A can occur only if 
n=b+(b+1)+---+(2b-1), 


i.e. n = w(b). The exception for operation B occurs if n = (s + 
1) +--+ + 2s, ie. nm = w(—s). In all other cases exactly one of 
the operations can be carried out and we thus have a one-to-one 
correspondence between partitions of n into an even number of 
unequal parts and partitions of n into an odd number of unequal 
parts; and hence for these values of n, we have p.(n) — po(n) = 0. 
In the exceptional cases the difference is +1 or —1. 


Euler used Theorem 15.5 to find the following recursion formula 
for p(n). 


Theorem 15.6. Let p(n) = 0 forn <0. Then for n > 1, 


Co 


(15.10) p(n) = So(-1)"*" {p(n — w(m)) + p(n — w(—m))}. 


m=1 


PROOF: This is an immediate consequence of Theorem 15.2 and 
Theorem 15.5. 


Note that the sum in (15.10) is finite. With this recursion one 
can rapidly generate a table for p(n) for small values of n. 

The first two terms of the recursion formula are the same as in 
the famous Fibonacci recursion (cf. Problem 5E). Although these 
are followed by two negative terms, one might think that p(n) will 
increase in a way similar to the Fibonacci numbers, i.e. as c” for 
some constant c. This is not the case. We shall show that p(n) 
grows much more slowly. The actual asymptotics of p(n) involve 
complicated methods from analytic number theory. We mention 
only the main term of the asymptotic formula: 


1 Z 
lim n-? log p(n) = ny? 


One can show that in fact p(n) is much smaller than exp (x an). 


160 A Course in Combinatorics 


Theorem 15.7. For n > 2 we have 


p(n) < —_* et V5", 
6(n — 1) 


Proor: Let f(t) := log P(t). From Theorem 15.2 we find 


ft) =— os -t) = DY =e, 
k=1 k=1 j=l j=l 


From now on let 0 < t < 1. Then from 
(1-24) -#)=14+t+-.-+0 1 > je, 


we find 


Since p(n) is increasing, we have P(t) > p(n)t"(1 — t)~+. By com- 
bining the two inequalities and then substituting ¢ = (1 + u)~!, we 
find 


log p(n) < f(t) — nlogt + log(1 — t) 
™ 


t 
2 
= ut tnlog(1 + wu) + log 


Therefore 


1 
log p(n) < gnu +(n—1)u+ logu. 


—i 
2, 


We get the required inequality by substituting u = 7{6(n — 1)} 


Euler’s identity is a special case of a famous identity of Jacobi 
that is important in the theory of theta functions. We would not 
mention this so-called Jacobi triple product identity if we had not 
known of the beautiful combinatorial proof using a Ferrers diagram, 
due to Wright (1965). 
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Theorem 15.8. We have 


[[a _ aye 4 Ge OG oe as ee = S- q’ t". 
n=1 T=—0O0 


PROOF: We rewrite the assertion as 


n— n—14— rar n\ — 
[[Ja@+er nase te) = SO ge [Ja-7)4, 
n=1 T=—0o n=] 


and substitute « = qt, y = qt~!. This yields the relation 
(15. 11) Lt l+a"y n-1 \1+arty")} 


eo) 
1 
aS e232" (r+ ay ar( he Pal y") 


r>=—OoOo 


We shall prove this relation by interpreting both sides as generating 
functions corresponding to the counting of appropriate combinato- 
rial objects, and then showing a one-to-one mapping between these 
objects. For the left-hand side, we find as a combinatorial interpre- 
tation the generating function for the number of partitions of the 
Gaussian integer n + mi into parts a + (a — 1)i and (b — 1) + bi 
(a > 1,6 > 1), with no two parts equal. Call this number a(n, m). 

So the left-hand side of (15.11) is 30°, 07, a(n,m)a"y™. On 
the right-hand side, we use Theorem 15.2 and replace the hee 
by 322. p(k)z*y*. We must therefore prove that a(n,m) = p(k), 
where n =k + $r(r +1) and m =k + $r(r — 1). Without loss of 
generality, we may assume that n > m, i.e. r > 0. A partition of 
n+mi must have v > 0 terms of type (b— 1) + bi and v+r terms 
of type a+ (a — 1)i, and therefore n > $r(r +1),sok>0. In Fig. 
15.5 below, we consider the example (n,m) = (47,44), so r = 3 
and k = 41. We have taken the Ferrers diagram of the partition 
41 = 124+10+8+5+2+4+2-+ 2 and above the top row we have 
added rows of length r,r —1,...,1. 
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N 


LZ 


Figure 15.5 


The resulting diagram is split into two parts. The shaded part on 
the left is determined by the ‘staircase’ starting with the part that 
was added at the top. We read the shaded part by columns. By the 
construction, the sequence is decreasing. These numbers are the a’s 
for terms of type a+ (a —1)i. The unshaded part is read by rows 
and again the numbers are decreasing. These are the 6 — 1’s for 
terms of type (b—1)+0i (the last b could be 1, corresponding to one 
empty unshaded row). The number of a’s exceeds the number of 
b—1’s by r and the sum of the a’s and b— 1’s is clearly k+ $r(r +1). 
So we have indeed produced a partition of n + mi of the required 
type. It is easy to see that the procedure can be reversed, i.e. we 
have defined the one-to-one mapping we were looking for. 


Problem 15F. Prove that the number of self-conjugate partitions 
of n equals the number of partitions of n into unequal odd parts. 


As a final problem related to partitions we shall consider objects 
known as Young tableaux or standard tableaux. A Young tableau of 
shape (1, 72,...,%m) is a Ferrers diagram (or Young diagram) of 
squares in which the integers 1 to n have been inserted (one in each 
square) in such a way that all rows and columns are increasing. For 
example, in Fig. 15.6 we display a Young tableau of shape (5,4,2,1). 
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3/4] 7/11 
10} 12 


OU 


COlD!)NmM] FR 
oOo 


Figure 15.6 


We are interested in determining the number of Young tableaux 
of a given shape. At first, this may seem like a somewhat unnatural 
problem. However, Young tableaux play an important role in the 
theory of group representations (and other areas). One of the inter- 
esting facts concerning these tableaux is that there is a one-to-one 
correspondence between the Young tableaux with n squares (that 
we shall call ‘cells’) and involutions of 1 to n, where we include 
the identity as an involution. (Note that we can therefore count 
the total number of tableaux using Problem 14G.) For a treatment 
of this one-to-one correspondence and several related problems we 
refer to D. Knuth (1973). 

In order to count the Young tableaux of a given shape, we need 
to introduce the function A(x ,...,2,) defined by 


(15.12) Nie dee. | eae), 


1<i<jg<m 


(Note that A is the value of the Vandermonde determinant.) 
Lemma 15.9. Let 


Digest TP) a A hs a ct 
+ 29 A(a1, 22+ Y,---,Lm) +°°+ + Lm Ala, Lo,.--,2m ty). 


Then 
m 
Otis) = (x +---+am+ ("))») A(Gis x23 a): 


PRooF: Clearly the function g is a homogeneous polynomial of 
degree 1 + deg A(a1,...,%m) in the variables 21,...,%m,y. If we 
interchange x; and x;, then g changes sign. So if 7; = xj, then 
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g must be 0, i.e. g is divisible by x; — x; and hence by the poly- 
nomial A(x1,...,%m). If y = 0, the assertion is obvious. There- 
fore we only have to prove the coefficient (3) for y. If we ex- 
pand g, then the terms of degree 1 in y are “4 5A(a, 25M) atid 


—“i¥ A(a1,...,2m) for all pairs (i, 7) with 1 < i 2 j <m. The sum 


Uji-L;j 


of these terms is clearly CAG bali e 


We introduce a function f defined on all m-tuples (11,...,m), 
m > 1, with the following properties: 


(15.13) 
FE Wiehe Stl) =U) unless nj > ng >--- > 0; 
(15.14) 
ties 0) SF ee a 
(15.15) 


f(nm1,---;%m) = f(m1 — 1, 12,---,%m)+ 
f(mi, ne —1,...,%m) +--+ + f(r, na, -.-,%m — 1), 
if ny > ng >--- > Nm > O; 
(15.16) f(n)=1  ifn>0. 


Clearly f is well defined. We claim that f(mi,...,m%m) counts the 
number of Young tableaux of shape (m1,...,%m). Condition (15.13) 
is obvious and so are (15.14) and (15.16). To see that the number of 
Young tableaux of shape (n1,...,%m) satisfies (15.15), we consider 
the entry n. It must be the last entry in one of the rows, and if 
we remove the square with n, then we obtain a Young tableau for 
n—1. In fact, if two or more rows of the tableau have the same 
length, then n can only be the last entry in the lowest of these; but 
including all terms in (15.15) does not matter since if, say, n1 = no, 
then f(n1 —1,1n2,...,%m) = 0. 


Theorem 15.10. The number of Young tableaux that have shape 

(n1,.-.,%m) satisfies 

A(ni +m—1,n2+m—2,..., Mm) n! 
(ny +m—1)!(ng +m — Dr. erie! 


(15.17) f(m1,...,%m) = 


v] 


and in fact this formula for f is correct ifny+m—1>n2e+tm—2> 
. = Nm: 
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PROOF: We first observe that, if for some i we have nj +m —1 = 
nisi + m—i-—1, then the expression A on the right-hand side 
of (15.17) is 0, in accordance with the fact that the shape is not 
allowed. We must show that the right-hand side of (15.17) satisfies 
(15.13) to (15.16). All but (15.15) are trivial. We find the relation 
(15.15) by substituting 7; = n;+m-—i and y = —1 in Lemma 15.9. 


There is a formulation of this theorem that makes it more in- 
teresting. We introduce the concept of ‘hook’ in a Young tableau. 
The hook corresponding to a cell in a tableau is the union of the 
cell and all cells to the right in the same row and all cells below it 
in the same column. The ‘hook-length’is the number of cells in the 
hook. In Fig. 15.7 we show a Young diagram with in each cell a 
number equal to the length of the corresponding hook. The shaded 
area is the hook corresponding to the cell in row 2 and column 3. 


12)11)9/}7)5)2/1 
9 | 8 [642 e 
8|7(e)3/1\e 
6) 573] 1) ¢ 
4|3/1,¢0 
2/lje 

Figure 15.7 


We now have the following remarkable theorem due to J. S. 
Frame, G. de Beauregard Robinson and R. M. Thrall (1954). 


Theorem 15.11. The number of Young tableaux of a given shape 
and a total of n cells is n! divided by the product of all hook-lengths. 


ProoF: If we have a Young tableau of shape (1,...,%m), then the 
hook-length for the cell in row 1 and column 1 isn,+m-—1. Consider 
the hook-lengths for the cells in row 1. In Fig. 15.7 the second 
column has the same length as column 1 and hence the hook-length 
for the next cell is ny +m — 2. For the next cell the length drops 
by 2 because of the ‘missing cell’ indicated by a dot in the last row. 
Similarly, there is a jump of 3 for the column with two dots. So in 
row | we find as hook-lengths the numbers from 1 to nj-+m—1 with 
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the exception of the numbers (n} ++m—1)—(nj+m—j),2 <j <m. 
Similarly row 7 contains the integers from 1 to n;-+m—i except for 
(nj +m—t)—(nj+m—Jj), i+1 <j <m. It follows that the product 
of the hook-lengths is {][/", (ni + m—i)!}/A(m1 +m—1,...,7m). 
By Lemma 15.9 we are done. 


Problem 15G. Consider a Young tableau of shape (n,n). Define 
a sequence az, 1 <k < 2n, by ay :=7 if k is in row 7 of the tableau, 
7 = 1,2. Use this to show that the number of Young tableaux of 
this shape is the Catalan number wu,,+, (in accordance with Theorem 
15.11). 


Problem 15H. How many Ferrers diagrams are there that fit in 
a box of size k x n—k? 


Problem 15I. Prove that the number of partitions of n into parts 
not divisible by d equals the number of partitions in which no part 
occurs more than d — 1 times. 


Problem 15J. Define P, := [];_,k!. Prove that P?_, - (n’)! is 
divisible by Po)_1. 


Notes. 

The use of Ferrers diagrams to prove theorems such as Theorem 
15.3 was introduced by Sylvester in 1853. He wrote that the proof 
had been communicated to him by N. M. Ferrers. 

For other proofs involving Ferrers diagrams and analogues (e.g. 
a proof of Theorem 15.4) we refer to Hardy and Wright (1954), 
MacMahon (1916), and Van Lint (1974). 

Euler’s proof of Theorem 15.5 was by induction. Polygonal num- 
bers occurred in ancient mathematics, e.g. triangular numbers were 
considered by Pythagoras before 500 BC. 

Theorem 15.7 is due to Van Lint (1974). 

The asymptotic behavior of the partition function p(n) was given 
by G. H. Hardy and S. Ramanujan (1918) in the paper in which 
they developed the famous ‘circle method’ that has had so many ap- 
plications in number theory. A further asymptotic result was given 
by H. Rademacher (1937). The proofs depend on the remarkable 
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functional equation for the famous Dedekind »-function 


oo 
ene) 


For an exposition of these proofs see Chandrasekharan (1970). 

C. G. J. Jacobi (1804-1851) is best known for developing the 
theory of elliptic functions. He became professor at Konigsberg at 
the age of 23. Two years later he published his book Fundamenta 
Nova Theoriae Functionum Ellipticarum. In 864 we find what is 
now known as the triple product identity. 

The first to use tableaux such as in Fig. 15.6 was Frobenius! A 
year later (1901) A. Young independently introduced them in his 
work on matrix representations of permutation groups. The name 
Young tableaux has become standard terminology. 
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16 
(0,1)-Matrices 


In the previous chapters (0,1)-matrices have turned up a number 
of times; and especially those with constant line-sums led to in- 
teresting problems. In this chapter we shall consider the existence 
problem for (0,1)-matrices with given line-sums, and try to count 
or estimate how many there are in the case of constant line-sums. 
In the first problem, the matrices are not necessarily square. If 
the row-sums of a matrix A are r1,72,...,7,%, then we shall call the 
vector r := (r1,12,---,Tk) the row-sum of A, and similarly for the 
column-sums. 

We consider the problem of the existence of a (0, 1)-matrix with 
given row-sum r and column-sum s. For convenience, we shall 
assume that the coordinates of r and s are nonincreasing; that is, 
r and s are partitions (we may allow trailing zero coordinates in 
partitions for the purposes of this chapter). 

Given two partitions r = (r1,7T2,...,Tn) and s = (81, S2,..., 5m) 
of the same integer N, we say that r majorizes s when 


Cio ose ea he Sep eae eee aE 


for all k, interpreting rz, or s; as 0 when k exceeds n or m, respec- 
tively. Recall that the conjugate of a partition r is the partition r* 
where r; is the number of 7 such that r; > 7. 


Problem 16A. Prove the following assertion: r majorizes s if and 
only if the latter can be obtained from the former by a sequence of 
operations ‘pick two parts (coordinates) a and b so that a > b+ 2 
and replace them by a— 1 and 6+ 1’. (That is, s is ‘more average’ 
than r.) For example, (5,4,1) majorizes (3,3,3,1) and the latter 
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can be obtained from the former in several ways with sequences of 
such operations, including 


GAL0) 6,9 9.0) 153.14) S499 4) 33.3) 


and 
(5A 1:0) Sa ey (49 9) 39534): 


Theorem 16.1. Let 71,79,...,1%, and $1, 82,...,8m be two nonin- 
creasing sequences of nonnegative integers each summing to a com- 
mon value N. There exists ann x m (0,1)-matrix with row-sum 
r and column-sum s if and only if r* majorizes s. 


PROOF: To prove necessity, let k be given and consider the first k 
columns of a hypothetical (0, 1)-matrix with row-sum r and column- 
sum s. The number of 1’s in these columns is 


n k 


6) go eee aS Y= min(k, 7) =i 


i=1 j=l 


The latter equality is most evident from the Ferrers diagram of the 
partition r (see Fig. 16.1) where we see both expressions count the 
number of cells in the first k columns. 


YUYYY, 


We now introduce a transportation network with a source S, 
a sink T,, one vertex x; for each row, and one vertex y; for each 
column. There is to be an edge of capacity r; directed from S' to 
xj, 1<i<n, an edge of capacity s; from y; to T, 1 <j <m, and 
an edge of capacity 1 from x; to yj, 1<i<n,l<j <m. We claim 
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that a (0, 1)-matrix M = (a;;) with row-sum r and column-sum s 
exists if and only if this network admits a flow of strength N (there 
are at least two cuts of capacity N, so this would be a maxflow). 
Given such a matrix, we get a flow of strength N by saturating the 
edges incident with S and T’ and assigning flow a;; to the edge from 
x; to y;. Conversely, if the network admits a flow of strength N, 
then there exists an integral flow of that strength (cf. Theorem 7.2); 
clearly, the edges incident with either S or TJ’ must be saturated, 
and on the other edges the flow must be either 0 or 1. 

Consider a cut A,B in this network which separates S and T. 
Say A consists of S, no vertices of X := {x;:1=1,...,n}, and mo 
vertices of Y := {y;: i= 1,...,m}. The edges crossing from A to 
B include n— no edges leaving S', mg edges into T’, and no(m— mo) 
edges from X to Y. The capacity of this cut is at least 


Trotl + Png +2 a +Tp, 7 Sm—motl a Sm—mo+2 at ease Sm + No (m a mo). 


Figure 16.2 


It is again convenient to refer to the Ferrers diagram of the parti- 
tion r (see Fig. 16.2). The number of cells in the Ferrers diagram is 
N. So clearly N is at most no(m— mo) plus the number of cells in 
the last n —no rows plus the number of cells in the last mo columns. 
The number of cells in the last mp columns is the sum of the last 
mo parts of the conjugate r*. But under the assumption that r* 
majorizes s, 


* * * 
Pm mot+l1 ae lm mMo+2 S gee ae Tm = Sm—mot+l oe Sm—mo+2 SRS 
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and we conclude that the capacity of any cut is at least N. By 
the maxflow-mincut theorem, Theorem 7.1, the required flow of 
strength N exists. 


We have given the above proof in order to establish the connec- 
tion between flows and (0,1)-matrices, but it is possible to give a 
more direct proof of sufficiency of the condition in Theorem 16.1. 
Our exposition is similar to that given by Krause (1996). 

First suppose we have a (0,1)-matrix A, with row-sum r and 
column-sum s,, and that s2 is obtained from s; by the operation 
of Problem 16A. Then we claim it is easy to construct a (0,1)- 
matrix Ay with row-sum r and column-sum sy»: If the parts a,b 
with a > b+2 in s; (that are replaced by a—1 and b+ 1 to obtain 
S.) are the sums of columns j and k, respectively, there must be a 
row 7 so that the (i,7)-entry in A, is 1 and the (7,/)-entry in A, 
is 0; if we interchange these two entries, we obtain a matrix A» as 
desired. 

Now assume r* majorizes s. Start with the matrix Ap correspond- 
ing to the Ferrers diagram of r, adding columns of 0’s if necessary 
to obtain size n x m. The column-sum of Ag is r*. If r* majorizes 
s, we can find a sequence of partitions 


* 
r =So,81,82,-.--,S¢=S 


so that each partition in the sequence is obtained from the pre- 
ceding one by the operation of Problem 16A. Then a (0, 1)-matrix 
A with row-sum r and column-sum s can, by the observation of 
the preceding paragraph, be obtained from Ap by a sequence of ¢ 
interchanges. 


Example 16.1. Take r = (3,2,2,2,1) and s = (3,3,3,1). Then 
r* = (5,4, 1), and this majorizes s. The algorithm described above 
might give rise to the following sequence of matrices: 


1110 1110 1110 1011 
1100 0110 0110 0110 
1100 —. 1100 —= 0110 — 0110 
1100 1100 1100 1100 


1000 1000 1000 1000 
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Here the column-sum vectors are 
(5,4, 1,0) — (4,4, 2,0) — (3,4, 3,0) — (3,3, 3, 1). 


(We do not care if the vectors are not always nonincreasing.) 


We extend the idea of the algorithm in the theorem below, and 
find a (rough) estimate for the number of (0, 1)-matrices with cer- 
tain line-sums as a consequence. 


Theorem 16.2. Given partitions r and s of an integer N, let 
M(r,s) denote the number of (0,1)-matrices A with row-sum r 
and column-sum s. If r majorizes ro and s majorizes So, then 


M(ro, So) > Mr, s). 


PROOF: We will establish the following simple observation. Let us 
fix a row-sum vector r = (r1,7T2,.--,1%) of length n and a column- 
sum vector s = (81, 59,...,m) of length m that have equal sums. 
(We do not require them to be nonincreasing.) If s; > so, then we 
claim that 


M(r, (sy 2 de, S92 ts ds S35+2+5 Sin) Ze Mr, (s1, $2, 53,+++, Sit): 


Of course, the same will hold for any two columns (we have used the 
first two only for notational convenience). The same idea applies 
to the transpose, where keeping s constant and replacing r; and 
r; (r~; > 73) by rj — 1 and r; + 1 will not decrease the number 
of associated (0,1)-matrices. The inequality of the theorem then 
follows from this observation and the result of Problem 16A. 

To prove the observation, consider (0,1)-matrices A of size n x 
(m — 2) with column-sum (53, $4,...,5m). For a given matrix A, 
it may or may not be possible to prepend two columns to A to 
get row-sum r; if it is possible, we need to add two 1’s to a rows, 
one 1 to b rows, and no 1’s to c rows, say, wherea+b+c=n 
and 2a+ b = s; + s9. But the number of ways to prepend the two 
columns to get new column-sums s; and 59 is 


Ga Cee 
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and this is at most the number 


bog) Gay) 


of ways to prepend the two columns to get new column-sums s — 1 
and s2 +1 since the latter sums are closer to b/2. 


The sufficiency of the condition of Theorem 16.1 is a quick corol- 
lary of Theorem 16.2. If r* majorizes s, then 


M(r,s) > M(r,r*) > 1. 
Problem 16B. Prove that M(r,r*) = 1. 


We will use A(n, k) to denote the number of n x n (0, 1)-matrices 
with all line-sums equal to k. 


Corollary. 

A(n,k) > (i) jae} 
Proor: The number of (0,1)-matrices with row-sum (k,k,...,k) 
is ‘oe Each has a column-sum s = (51, S82, 3,..., Sn) satisfying 


O0<s; <n, 5s; +---+ 8, = nk. Ignoring the restriction s; < n, 
we see that by Theorem 13.1 there are at most aa ) es) 
such column-sums. Since the greatest number of associated (0, 1)- 
matrices occurs for column-sum (k,k,...,), the number of such 
matrices is at least the average number. 


Problem 16C. Prove the following theorem. 


Theorem 16.3. Let d and d’ be two partitions of an (even) integer 
N. Ifd majorizes d’, then there are at least as many labeled simple 
graphs with degree sequence d’ as with degree sequence d. 


Problem 16D. If 7 is even, then 
2 


1 PAL 
Show this by modifying the proof of the Corollary to Theorem 16.2. 
(Substituting k = sn in the corollary would give a poor result.) 


Problem 16E. Let d = (dj, d2,...,d,) be a partition (possibly 
including 0’s) of (5). Prove that there exists a tournament on the 
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vertex set 1,2,...,7 (an orientation of K,,) so that vertex 7 has 
outdegree d; for all 2 if and only if d is majorized by 


Gea 5 D0): 


Problem 16F. (i) Given an integer m < (5); consider the graph 
with vertex set 1,2,...,n and whose edges are the first m of the 


2-subsets in lexicographical order: 


{1,2}, {1,3},..., {1,n}, {2, 3}, (2, 4},...{2,n}, {3,4},.... 


This graph has a certain degree sequence r, which is a partition of 
2m. For example, when n = 8 and m = 20, r = (7, 7,7, 5,4, 4, 3, 3). 
Given n, m < (5), and a partition d (possibly including 0’s) of 
2m, prove that there exists a simple graph on n vertices with de- 
gree sequence d if and only if d is majorized by the partition r as 
described above. 

(ii) Generalize for simple k-uniform hypergraphs (a set of points 
and a set of k-element subsets). 


As in Chapter 11, let A(n,2) denote the set of (0, 1)-matrices of 
size n with all line-sums equal to 2. A method that we learned in 
Chapter 14 allows us to count the number A(n, 2) of elements of 
A(n, 2). 

Let A(n,2)* denote the subset of A(n,2) consisting of indecom- 
posable matrices (cf. Chapter 11). Define a, := |A*(n,2)|. It is 
easy to see that a, = $n!(n — 1)! as follows. We have (5) choices 
for the first row. If we choose (1 1 0 ...0) as first row, then there 
is one more row (out of n — 1) with a 1 in the first column and a 
second 1 not in the second column (so n — 2 choices). This gives us 
(n — 1)(n — 2) possibilities, etc. 

Now define 


My = A(n, 2)/(n!)’, 
by. = a,/(k!)?. 
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Then by the same argument as was used in Theorem 14.2, we have 
(16.1) 1+ 5° mx” = exp (>: na) 

n=2 k=2 


Therefore we find 


(16.2) 1+ S>mnax" = exp (= 7 est = ) Se (Ieee. 


n=2 


It is an easy exercise to show that from the expansion 


we can conclude that 


a 2) ee 
Mn ~ e ( jer for n — oo. 
n 


We have thus proved the following theorem. 


Theorem 16.4. 
(2n)! 


(212 


This theorem is a special case of the following theorem (that we 
shall only prove for k = 3, using a method that can be applied for 
other values of k). 


Theorem 16.5. 


A(n,k) = ne exp _-— +0 ( 


uniformly ink for1 <k < logn. 


NiFR 


A(n,2) ~ e~ 


}b non, 


For more information on this question and generalizations to the 
case where the line-sums are not constant we refer to B. D. McKay 
(1984). 


~ 
etco] / 
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For the case k = 3, we now prove a result slightly stronger than 
obtained by setting k to 3 above. We shall use the truncated form of 
the principle of inclusion and exclusion (cf. Theorem 10.1, Remark). 

Consider N := 3n elements, numbered 


Tae 1;, le 2a) 2p, Qe see Mas Mp, Ne- 


Form a permutation of these N elements and subsequently form the 
corresponding ordered partition into triples: (x,y, z)(u,v,w)... . 
We shall say that a repetition occurs if, in at least one of these 
triples, some number occurs more than once, as in (54,39, 5-). As- 
sume that the chosen permutation results in a sequence of n 
triples with no repetition. In that case, we associate an n x n 
(0, 1)-matrix with the permutation as follows. If the i-th triple is 
(La, Ys, 2%), Where {a, 3, y} C {a,b,c}, then the matrix has ones in 
row 7 in the columns numbered wz, y, z, and zeros elsewhere. Since 
the indices a,b,c do not influence the matrix, and the order of the 
three elements within one triple is also irrelevant, each such matrix 
corresponds to (3!)?” different permutations. Clearly the matrix is 
in A(n,3). We must find an estimate for the number P of permu- 
tations with no repetition among all No := N! permutations of the 
3n elements. 

Let 1 <r <n. Specify r triples and count the permutations that 
have a repetition in these triples. Then sum this over all choices of 
the r triples. As in Theorem 10.1, we call this number N,.. Then if 
R is even, we have 


R+1 R 
(16.3) So(-1)'N, < P< S0(-1)'N,. 
r=0 r=0 


The difficult part of the proof is finding a suitable upper estimate 
and lower estimate for N,.. 

We start with the upper estimate. The r triples can be chosen 
in C) ways. Within each triple we specify two positions, where we 
require a repetition. This can be done in 3” ways. Now we choose 
r numbers that will occur in the repetitions, and the indices a, b,c. 
This can also be done in (") - 3" ways. Subsequently we distribute 
the chosen numbers over the chosen positions, and that can be 
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done in 2"-r! ways. Finally, we distribute the remaining N — 2r 
numbers arbitrarily. Clearly, a permutation with a repetition of 
type (5a, 5p, 5-) has been counted more than once. So we have 


(16.4) 
2 
N, < e 377.9" -rl. (N — 2r)! 
Tr 


r 


Qe 
< (3° - n*)"(N = 2r)!l < NIG ( 
Tr: 


N= oF\- 7 
N 


rie gee s/n. Here the last step is based on the power series expan- 
sion of (1 — )~?", and choosing r in such a way that each term is 
at most half of the previous term. 

For the lower estimate, we start in the same way. After the r pairs 
have been chosen and distributed over the chosen positions, we first 
complete the r triples that have a repetition, using elements with a 
number that differs from the two repeated numbers for each triple. 
This can clearly be done in more than (N — 3r)" ways. Then we 
distribute the remaining numbers. Some permutations that should 
be counted are not, i.e. 


(16.5) 
N, > ) 327.97 - rl. (N — 3r)"(N — 3r)! 
> NIIe wy nn -1). Ce ] ies 


(1-=) Si. oH Pep en. 
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We now combine (16.3), (16.4) and (16.5). We take R = |$/n]. 
We find 


P . 9R+1 3 oo 2" yr? 

—=e "+A h A| < ‘ 

mere See oI (RHI! nt! 
and hence 


48 18e? 200 
JA] << —+ < 
nm nm 


<— 
We have therefore found the following estimate (in accordance with 
Theorem 16.5). 
Theorem 16.6. We have 
(3n)! _, 1 
A(n, 3) = (ain 1+ Oo") (n — oo). 


Remark. Note that for k = 3, we find from the Corollary to The- 
orem 16.2 that A(n,3) grows at least as fast as (cn?)”, for a suitable 
constant c, and this differs from Theorem 16.6 only in the constant. 


Problem 16G. Determine A(5,3). 


We now describe another connection between (16.2) and Chapter 
14. Note that squaring the right-hand side of (16.2) yields the 
exponential generating function for the number of derangements 
as given in Example 14.10. This means that we should be able to 
show combinatorially that 


n > 
(16.6) nd, = S> (;) i 


k=0 
where we have used A,, for A(n, 2). 
As a preparation, consider an indecomposable component of an 
element of A(n,2), for example the following 5 x 5 submatrix for 
the case n = 9: 


13 4 5 9 
1 LA Oe O50 
2 dO a: Oe) 
3 0 1 0 0 1 
7 O)ooGe de 
9 Oe 902 20s" al 
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Starting with the two 1’s in the top row, there are two ways to 
describe the cyclic structure: 


(1,3), (3,9), (9,5), (5,4), (4,1) 
corresponding to the permutation (1 3 9 7 2) of the rows and 
(3,1), 1,4), (4,5), (5,9), (9,3) 


corresponding to the reversed permutation (1 2 7 9 3). 

The first case (1 < 3) we call the red variety, the second the blue 
variety. 

Consider the set S consisting of elements of A(n,2) with the 
1’s colored red or blue in such a way that in an indecomposable 
component all 1’s have the same color. Clearly 


n 2 
|= 30 (j) Avance 


k=0 


We define a one-to-one mapping from S to the set of pairs of per- 


mutations (a1, d2,...,@n), (b1, b2,...,b,) of 1,2,...,n in which the 
first is a derangement. There are n!d, such pairs. We do this by 
example. 

Let 


a i= (a1, de, _ ., ag) = (2,7, 1,8, 4, 5,9, 6, 3), 
i= (b1, bo, oe ., 09) _ (3, 1,4, 5, 9, 2, 6, 8, 1): 


The canonical cycle decomposition of a is (1 2 7 9 3)(4 8 6 5). 
We split b accordingly into (3 1 4 5 9) and (2 6 8 7). The first 
pair describes the 5 x 5 blue submatrix treated above; the second 
describes a red 4 x 4 submatrix. 

This establishes (16.6). 

The following problem gives one more relation with Chapter 14. 


Problem 16H. Again let A, := A(n,2). Consider a matrix in 
A(n, 2). There are 1) ways to choose the two 1’s in row 1. Suppose 
these are in the first two columns. Possibly some other row also has 
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two 1’s in the first two columns. Now, consider the other possibility 
and show that 

-—1 
(16.7) A, = sey +(n—1)Ap_2). 
Problem 16I. Let f(x) be the function defined by the left-hand 
side of (16.1). From (16.7) derive the differential equation 


2(1—x)f'-—af =0 


and give a second proof of (16.2). 


Notes. 

Theorem 16.1 is due to D. Gale (1957) and to H. J. Ryser (1957). 
For a more extensive treatment of the classes of (0,1)-matrices that 
we have considered, we refer to Ryser (1963). 

The relation between (16.2) and the generating function for de- 
rangements was pointed out to us by D. G. E. D. Rogers. The 
proof of (16.6) is due to P. Diaconis and D. E. Knuth (private 
communication). 

The relation (16.7) was announced without proof by R. Bricard 
(1901). 
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Latin squares 


A Latin square of order n is a quadruple (R,C,S;L) where R, C, 
and S are sets of cardinality n and L isa mapping L: Rx C—S 
such that for any 7 € R and x € S, the equation 


L(i,j) =x 


has a unique solution 7 € C, and for any 7 € C, x € S, the same 
equation has a unique solution 7 © R. That is, any two of 7 € R, 
j € C, x € S uniquely determine the third so that L(i,j) = 2. 
Elements of R are called rows, elements of C' are called columns, 
and elements of S are called the symbols or entries of the Latin 
square. A Latin square is usually written as an n x n array for 
which the cell in row 7 and column j contains the symbol L(i, 7). 
In Fig. 17.1 we have an example of a Latin square of order 5. 


d 
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Figure 17.1 


The terminology ‘Latin square’ originated with Euler who used 
a set of Latin letters for S. 

A quasigroup is a Latin square (X, X, X;0) with a common row, 
column, and symbol set X. Here we abbreviate the quadruple and 
simply denote the quasigroup by (X,o). The mapping o is now a 
binary operation on X and the image of (x,y) under o is denoted 
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by xoy. So as a special case, we obtain Latin squares that are the 
multiplication tables of groups. 


Problem 17A. If we fix the first two rows in Fig. 17.1, then there 
are many ways to fill in the remaining three rows to obtain a Latin 
square (in fact 24 ways). Show that none of these Latin squares is 
the multiplication table of a group. 


Note that if (R,C,S;L) is a Latin square and the mappings o : 
R-R,7:C3C7,27: S —S' are bijections, and if we define 
L'(a(t),7(j)) := a(L(i,7)), then (R’,C’, S’; L’) is a Latin square. 
The two Latin squares are called equivalent. So with this notion of 
equivalence, we may assume that a Latin square of order n on the 
set S := {1,2,...,n} has the integers from 1 to n in that order as its 
first row and as its first column. Note that two Latin squares that 
are normalized in this way and different, can still be equivalent. 

Sometimes we denote a Latin square as just ‘0: Rx C— S’. 

An orthogonal array OA(n,3) of order n and depth 3 is a 3 by 
n? array with the integers 1 to n as entries, such that for any two 
rows of the array, the n? vertical pairs occurring in these rows are 
different. Suppose that we have such an array. Call the rows r, c, 
and s, in any order. For any pair (7, 7), there is a k such that rz, = 7, 
c, = j. We make a square with entry s; in the i-th row and j-th 
column (for all i, 7). The definition of orthogonal array ensures that 
this is a Latin square and we can reverse the procedure. Therefore 
the concepts of Latin square and orthogonal array are equivalent. 
Fig. 17.2 shows an OA(4,3) and two corresponding Latin squares. 

Two Latin squares for which the corresponding orthogonal arrays 
have the same three rows (possibly in different order) are called con- 
jugates. For example, one of the conjugates of a Latin square (with 
R= C) is its transpose. As an exercise the reader should write out 
the six conjugates of the Latin square in Fig. 17.2. Two orthogonal 
arrays are called isomorphic if one can be obtained from the other 
by permutations of the elements in each of the rows and permuta- 
tions of the rows and columns of the array. Two Latin squares for 
which the corresponding orthogonal arrays are isomorphic are also 
called isomorphic. This means that one is equivalent to a conjugate 
of the other. 
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3/2/41] 1 
rows 1111222233334444 ilalals 
columns 1234123412341234 AC SENC 
symbols 3241142343122134 

2/1/3/4 

2/4/3/1 
symbols 1111222233334444 Elael os 
columns 1234123412341234 iia)alo 
rows 3241142343122134 

3/2/1/4 

Figure 17.2 


Problem 17B. Consider the two Latin squares obtained from Fig. 
17.3 by setting a = 1, b = 2, respectively a = 2, b= 1. 
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Figure 17.3 


Show that these two squares are equivalent. Show that a square 
that is not equivalent to these corresponds to the cyclic group of 
order 5. 


One can also interpret a Latin square as a three-dimensional 
array of 0’s and 1’s with exactly one 1 in each line of the array that 
is parallel to one of the sides. For example, in Fig. 17.3 the entry 
4 in row 2 and column 3 should then be interpreted as an entry 1 
in position (2, 3,4) of the array. 

In a later chapter we shall consider orthogonal arrays with depth 
greater than 3. These are often much more difficult to construct 
than the arrays OA(n, 3). 


Problem 17C. Construct an OA(4,4), ic. a 4 by 16 matrix A 
with entries 1,2,3,4, such that for any two rows of A, say row 7 and 
row j, the 16 pairs (aix, a;,), 1 < k < 16, are all different. 
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A sub-Latin square (or subsquare) of a Latin square (R,C,S;L) 
is a Latin square (Rj, Ci, $1; £1) with Ry CR, CL CC, S, CS, 
and L,(i, 7) = L(i,7) for (4,7) € Ry x Ci. 


Problem 17D. Let m and n be positive integers, m < n. Show 
that m < sn is a necessary and sufficient condition for the existence 
of a Latin square of order n containing a subsquare of order m. 


The following problems are all of a similar nature. They concern 
a problem similar to the situation of Problems 17B and 17D, namely 
the completion of a square that has been partly filled. Ann x 
n array A with cells which are either empty or contain exactly 
one symbol is called a partial Latin square if no symbol occurs 
more than once in any row or column. (It might be better to say 
that the array ‘has the Latin property’, but we will use the former 
terminology.) We are interested in conditions that ensure that a 
partial Latin square can be completed to a Latin square of order n, 
i.e. when the empty cells can be filled with symbols so that a Latin 
square is obtained. For example, if A is a partial Latin square with 
filled (i.e. nonempty) cells everywhere except in the last row, there 
is clearly a unique way to complete A to a Latin square. On the 
other hand, the two partial Latin squares of Fig. 17.4 are clearly 
not completable. 


Figure 17.4 


To a partial Latin square we can associate an array in the same 
way that we did for Latin squares by only considering filled cells. 
This allows us to define the conjugate of a partial Latin square. 
The two partial Latin squares in Fig. 17.4 are not really essen- 
tialy different but are conjugates, as is evident from their ‘partial 
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OA(5, 3)’s’ shown below: 


rows ile ale 12345 
columns (:2545] (s2845] 
symbols 12345 11112 

If the first & rows of a partial Latin square (k < n) are filled 


and the remaining cells are empty, then A is called a k x n Latin 
rectangle. 


Theorem 17.1. Ak x n Latin rectangle, k <n, can be extended 
toak+1 x n Latin rectangle (and hence it can be completed). 


PROOF: Let B; denote the set of positive integers that do not occur 
in column j of A. Each of the numbers 1 to n occurs k times in 
A and therefore n — k times in the sets B;. Any I of the sets B; 
together contain /(n — k) elements, and therefore at least | different 
ones. So the sets Bj, 1 < 7 <n, have property H of Chapter 5, 
and by Theorem 5.1 they therefore have an SDR. This SDR can be 
adjoined as the (k + 1)-st row of the Latin rectangle. 


We denote by L(n) the total number of different Latin squares 
of order n. The exact value of L(n) is known only for n < 9. 
The numbers grow very rapidly, e.g. although there are only two 
inequivalent Latin squares of order 5, there are 5! - 4! - 56 different 
ones. 


Theorem 17.2. L(n) > (n!)2"/n™. 


PROOF: To construct a Latin square of order n, we can take any 
permutation of 1 to n as the first row. If we have constructed a 
Latin rectangle with k rows, then by the proof of Theorem 17.1, 
the number of choices for the next row is per B where b;; = 1 if 
i € B;. By Theorem 12.8 (the Van der Waerden conjecture) this 
permanent is at least (n —k)"-n!/n". So we find 


n—-1 


(17.1) L(n) > TT {(n = b)'nl/n"} = (nl)? /n™. 


k=1 
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Remark. If we had applied Theorem 5.3, we would have found 
the estimate L(n) > n!(n — 1)!---1! (H. J. Ryser, 1969), which is 
better for small n but asymptotically much smaller than (17.1). 


For n = 8, the estimate (17.1) is less than 10~+ times the true 
value but asymptotically (17.1) is best possible as the next theorem 
shows. 


Theorem 17.3. If we define £(n) := {L(n)}/"", then 
L(n)~e?n for n = oo. 


Proor: From (17.1) and Stirling’s formula we find n~'L(n) > e7?. 
By the same argument as in the proof of Theorem 17.2, and by 
estimating per B using Theorem 11.5, we find 


L(n) < [Lm k) < Tle". 
k=1 k=1 


Hence, again using Stirling’s formula with C some constant > 27, 
we have 


log £L(n) 


IA 


heel 
— > posh! 
Lara 


gens i 1 

—S {logk ~ 1+ 5¢logk-+ Zlogc } 
nm 

k=1 


IA 


2k k 


1 n 
—S “log k —1+0(1) 
n 
k=1 
= —2+logn+o(1) for n — o. 


Combining this with the lower bound, we find the result. 


The following theorem due to H. J. Ryser (1951) is a generaliza- 
tion of Problem 17D. 


Theorem 17.4. Let A be a partial Latin square of order n in 
which cell (i, 7) is filled if and only ifi < r and j < s. Then A can 
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be completed if and only if N(i) >r+s—n fori=1,...,n, where 
N(i) denotes the number of elements of A that are equal to 1. 


PROOF: First, observe that in a Latin square of order n, the first 
r rows contain exactly r elements equal to 2 of which at most n—s 
are in the last n — s columns. So the condition on N(‘) is trivially 
necessary. We show that the condition is also sufficient. Let B 
be the (0,1)-matrix of size r x n with b;; = 1 if and only if the 
element j does not occur in row 7 of A. Clearly every row of B has 
sum n—s. The j-th column of B has sum r — N(j) < n—s. By 
Theorem 7.5 (with d:=n— s) we have 


B= L@ty) ee L™ 


where each L is an r x n (0,1)-matrix with one 1 in each row 
and at most one 1 in each column. 

As an example, suppose r = s = 4, n = 7, and the first four rows 
of A are 


1)/2/3)4 
5/3]/11/6 
3/1)5)}2 
7/4/2)5 
Then e.g. 
0000111 
0101001 - 
BAS Wena a’ =L9O4LO4109= 
1010010 
0000001 0000010 0000100 
0001000 0000001 0100000 
0000010] + | ooo1000]~* | o000001 
1000000 0010000 0000010 


Say LY = an Then we fill the cell in position (i,7) of A, 
Pe Wei) = Saas DOR if 1? = 1. In our example, we 


would fill in the last three columns with 
t 6 


Dal Nm] ot 


4 7 
6 4 
1 3 
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Thus A is changed into a partial Latin square of order n with r 
complete rows, i.e. a Latin rectangle. By Theorem 17.1 this can be 
completed to a Latin square of order n. 


The examples of partial Latin squares that cannot be completed 
that were given in Fig. 17.4 both have n filled cells. The conjecture 
that a partial Latin square with less than n filled cells is completable 
to a Latin square was known as the Evans conjecture until it was 
finally proved by B. Smetaniuk (1981). Perhaps the most important 
part of Smetaniuk’s proof is the construction of Theorem 17.5. 


Theorem 17.5. Let A be a Latin square of order n. Let B be an 
n+1 x n+ 1 array whose (i,j)-entry is the (i,7)-entry of A for 
i,j >1,i+ 9 <n+1, which has a new symbol a on the back- 
diagonal, and whose cells below the back-diagonal are empty. Then 
B can be completed to a Latin square of order n + 1. 


An example of a Latin square A of order 5 and the corresponding 
array B are shown below. 


1/2;)3)/4/5]a 
1)/2/3]4)5 

4})3]5 a 
4}3]/5]/1)|2 

2/;5)/1/a@ 
2)5/1/3)4 

5} 1}a 
od} 1)4/2)3 

3/Q@ 
3}/4/2/5)] 1 

Qa 


PRooF: Let C' be the n x n array obtained from B by dropping 
the last row and column. In our example, C’ is the array below. 


1/2/3)4/5 
4)}3/5;lja 
2/;5)/1l)/a@ 
5}1l])a 
3/Q@ 


We describe an algorithm that fills in the empty cells of C row 
by row subject to the constraint that after the first r rows are 
filled in, we will have r x n a Latin rectangle on n + 1 symbols 
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1,2,...,n,a@ so that the 7-the column contains the same symbols, 
in some order, as there were in the first r rows of A, except for 
one so-called ‘missing’ symbol in each of the columns containing 2, 
and such that the r — 1 missing symbols are all distinct. This is 
trivial for r = 1 and r = 2. Assume this has been done for some r, 
2<r<nand that the missing symbols at this point are 


Ty—r425Un—r43)+++5Un- 


To fill in the (r + 1)-st row, we proceed as follows. 
Let the last r symbols in row r+ 1 of A (that have been removed 
in constructing C’) be 


Yn—rt ls Yn—rt2s Yn—rt35-++>Yn- 


Here y,_;+41 is the symbol displaced by a and must be the new 
missing element in that column. We may consider filling in row 
r+1 with 

Yn-rt2) Yn—-r4+35+++5 Uns 


and this is satisfactory if 


Yn—r41) Un—r425 Un—r43)+++5En 


(which would be the new missing symbols) are distinct. If not, 
consider the sequence 


Yn-r4+1 = Vk, 


Yk, = Uke 


Ykm—1 = Lkm 


extended as far as it can go, i.e. so that yz, is not equal to any 
of the z,;’s. We then fill in row r +1 with Yn—r+2, Yn—r+3, +++ 5 Yn 
except that Yx,,Yko,---+Yk, are omitted and replaced, respectively, 
DY Lky,Lko+-++,Lk,,- It must be checked that the entire row r+ 1 
contains distinct sysmbols. The new missing symbols will be yp_,+1 
and fn—r42, In—r43,---,%n except that x4,,2%,,.--, Xk, are omitted 
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and replaced, respectively, by yx,,Yko;--+-+Yk,,- It must be checked 
that these are distinct. 

Once the empty cells of C are filled, the first n cells of the last 
row of B are to be filled with x and the missing sysmbols (when 
r =n). We now have ann+1 x n Latin rectangle, which completes 
uniquely to a Latin square of order n + 1. 


Problem 17E. Let A denote the Latin square of order 10 on the 
left below, and let C’ be the partial 10 by 10 Latin rectangle on 
the 11 symbols 0,1,2,...,9,a@ on the right (not including the last 
row and column). Use the algorithm of the proof of Theorem 
17.5 to complete C' to a 10 by 10 Latin rectangle on the sym- 
bols 0,1,2,...,9,a. Then complete this Latin rectangle to a Latin 
square of order 11. 


NOTAINOAWAOH 
NIN OAMODOrFRPA TOW 
COoROMOrRPWHIAWMN 
WonrRANTMIORDS 
FOooOoOMaRNWAD 
COrRrANUMUANNWO 
ArPAtwWwooeonww 
oN BR WOROONE CO 
BROWN OORrFON 
Sownoanw vA oer 
SmMamoorrAw OW 
S0OrRPWNANMH 


4 
5 
9 
1 
2 
f 
6 
8 
3 
0 


QRIODWOoOOANNrR OTEK 


(For example, the algorithm fills in the first five rows of C as 
indicated on the right below. The elements ‘missing’ from the sev- 
enth through tenth columns at this point are, respectively, 5,9, 2,0. 
Now consider the sixth row. The a in column six forces 8 to be 
the missing element from column six. But 8 is distinct from the 
elements currently missing from the other columns. So there is 
nothing more to do; we just fill in row six with the entries 2,3, 6,5 
from the sixth row of A.) 
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413596028 7 4135960287 
589267341 0 508-2 6k BL a 
9456037821 945603 78al1 
13475289 0 6 134752 8a0 6 
261374509 8 261374a09 8 
7901482365 790 14a 


Problem 17F. 

(i) Let A and B be Latin squares of order n on symbols 1, 2,..., 7. 
Let A’ and B’ be the Latin squares on symbols 1,2,...,2+1 (with 
n+ 1’s on their back diagonals) that agree with A and B, respec- 
tively, above their back diagonals as constructed in the proof of 
Theorem 17.5. Show that if A and B are distinct, then A’ and B’ 
are distinct. This proves that the number of Latin squares of order 
n +1 with n+ 1 on the back diagonal is greater than or equal to 
the number N(n) of Latin squares of order n. 

(ii) Explain how part (i) leads to the result 


L(n) > n(n — 1)!--- 21! 


mentioned in the Remark following Theorem 17.2. 


Theorem 17.6. A partial Latin square of order n with at most 
n — 1 filled cells can be completed to a Latin square of order n. 


PROOF: The theorem will be proved by induction. Assume it holds 
for n and let L be a partial Latin square of order n+ 1 with at most 
n filled cells. 

First, suppose there is a symbol x of L that occurs only once in 
L. We propose to permute rows and columns to move the filled 
cells above the back-diagonal, except that the single symbol x will 
be be on the back diagonal. Say that the rows with filled cells have 
fi, fo,..-, fr filled cells, where f; + ---+ fe <n, and where x is 
in the row with f; filled cells. Put the row with f; filled cells in 
position n+ 1 — f;; permute columns to push the filled cells to the 
left. Put the row with f> filled cells in position n +1 — f; — fo; 
permute columns, other than the first f;, to move the filled cells 
to the left; the first f; + fo cells in that row will contain the filled 
cells. Put the row with fs filled cells in position n+1— ff; — fo— fs, 


17. Latin squares 193 


etc., and continue until all filled cells are above the back-diagonal. 
Then, if the cell containing x in row n+ 1 — f; is in column J, 
interchange column j and column f; + 1. 

By the induction hypothesis, the part above the back diagonal, 
which has at most n — 1 filled cells, can be completed to a Latin 
square A of order n based on symbols other than x. The array B as 
described in the statement of Theorem 17.5 contains the original 
filled cells of a partial Latin square that differs from DL only by 
row and column permutations. The theorem says that B can be 
completed, and hence so can L. 

At this point we can also assert that DL can be completed if there 
is a row that contains exactly one filled cell, or a column that 
contains exactly one filled cell, because a conjugate of L will have 
the property that some symbol appears exactly once (for example, 
every row of the partial square on the right of Fig 17.4 contains 
exactly one filled cell, and its conjugate on the left has the property 
that every symbol occurs exactly once) and a conjugate of L can 
be completed if and only if LD can be completed. 

So we may now assume that no row or column contains exactly 
one filled cell. Thus the filled cells are contained in at most m rows 
and m columns, where m := |n/2|. We may permute rows and 
columns so that all filled cells lie in the upper left subarray of order 
m. Theorem 17.4 implies that every m x m Latin rectangle can be 
completed to a Latin square of order n+1. So it will suffice to fill in 
the unfilled cells of the the upper left subarray of order m to get a 
Latin rectangle. But this is easy, as we have n+ 1 symbols available 
and we may proceed in any order, filling in a cell of the subsquare 
with a symbol not yet used in the row or column containing the 
cell. 


We now come to a generalization of the idea of a Latin square. 
Suppose we are given a collection C of n-sets C(i, 7), 7,7 =1,2,...,n. 
One can ask the question whether there exists a matrix A of size 
nm xX n in which in each row, respectively column, all the entries aj; 
are different and for which aj; € C(i,j) for all ¢ and 7. If all the 
n-sets are {1,2,...,n}, the answer is yes since any Latin square is 
a solution. If the elements of C are pairwise disjoint, the question 
is trivial. It is perhaps surprising that the increased freedom of not 
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requiring the n-sets to be the same makes the question harder. In 
fact, an analog of Theorem 17.1 does not hold in this case. Until it 
was proved by F. Galvin (1995), the conjecture that such a matrix 
A indeed exists was known as the Dinitz conjecture. 

We shall use the terminology of colorings of graphs (a problem 
treated extensively in Chapter 33). Let G be a simple graph. A 
list assignment of G is a function C that assigns a set C(v) to each 
vertex v of G. A C-coloring of G is a function A defined on the 
vertex set such that A(v) € C(v) and such that A(v) and A(w) are 
different if v and w are adjacent in G. 


Let f be a function from the vertex set of G to the integers. We 
say that G is f-colorable if a C-coloring of G exists for every C 
with the property |C(v)| > f(v) for all v € G. If k is an integer, 
a graph is called k-list-colorable if G is f-colorable for the constant 
function f(v) =k. Clearly, if G is k-list-colorable, then k is at least 
the chromatic number y(G), defined in Chapter 3. 

The line graph L = L(G) of a simple graph G is the simple graph 
with vertex set V(L) := E(G) and where a,b € E(G) are adjacent 
as vertices of L if and only if, as edges in G, a and b have a common 
incident vertex. Now, consider the line graph Lo(n) of Kn» (see 
Chapter 21). It has n? vertices {i,j}, i,j = 1,2,...,n, where 
pairs are joined if and only if they have an element in common. 
The Dinitz conjecture can be formulated as follows: Lo(n) is n- 
list-colorable. The theorem proved by Galvin states that for a 
bipartite graph G, the line graph L = L(G) is x(L)-list-colorable. 
Since Latin squares exist, L2(n) has chromatic number n and the 
Dinitz conjecture is therefore true. 


We shall prove two propositions from which the theorem will 
follow. 


A subset K of the vertex set of a digraph D is called a kernel of 
D if no two vertices of K are joined by an edge and for every v ¢ K 
there is a w € K such that there is an edge in D from v to w. 


Proposition 17.7. Let D be a digraph for which every induced 
subgraph has a kernel. Let f(v) := 1+outdegree(v) for every vertex 
v of D. Then D is f-colorable. 
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PROOF: We use induction on the number of vertices. Small cases 
are easily checked. (Note that the only digraph on three vertices 
for which the condition and the assertion of the problem do not 
hold is the directed circuit.) 


Let C be a list assignment of D with |C(v)| > f(v) for all v. Pick 
an element x from one of the sets in C’ and consider all vertices 
v such that « € C(v). We call the induced subgraph on these 
vertices D,. By our assumption, D,; has a kernel K. Let D2 be 
the induced subgraph on the vertices in D\K. For a vertex v ¢€ K 
define C’(v) := C(v)\{x}. By the induction hypothesis Dz has a 
C’-coloring A’. If we define A on D by 


| AO) aE Oe, 
Atv) = { 4 if vek, 


then A is clearly a C-coloring of D. 


For the next proposition we define an orientation of a graph G 
to be normal if every clique in G is a transitive tournament. 


Proposition 17.8. Every normal orientation of the line graph L = 
L(G) of a bipartite graph G has a kernel. 


PROOF: We use induction on the number of edges of G. For small 
numbers, the statement is easily checked. Suppose V(G) = X UY 
(and all edges between X and Y). We may assume that every 
vertex of G has a positive degree. Let D be a normal orientation 
of L(G). The cliques in L(G) correspond to vertices in X or in Y. 

For every vertex x € X, we consider the corresponding clique 
in L(G) and define e(x) := {x, y(x)} to be the sink of this clique 
(with respect to D). First, assume that for every x € X, this vertex 
e(a) of L(G) is the source of the clique defined by y(x) € Y. It is 
then obvious that all the y(x) for x € X are distinct. Therefore 
K := {{z,y(x)} : x € X} is a kernel of L(G). 

Next, assume that for some x € X, e(x) := {x,y(x)} is the sink 
of the clique corresponding to x but that e’ := {2’,y(x)}, where 
x’ # x, is the source of the clique corresponding to y(x). Remove 
the edge {x’, y(x)} from G. The resulting line graph has a kernel 
by the induction hypothesis. We claim that K is also a kernel for 
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D. We have to show that there is an edge from e’ to K. Ife(x) € K 
we are done (because e’ is a source). However, if e(x) ¢ K, then 
there is an edge from e(z) to some element in AK, which must be 
of type {x”, y(x)} by definition of e(x). Because the tournament 
defined by y(x) is transitive, we are again done. 


We can now prove the main result. 


Theorem 17.9. IfG is a bipartite graph, then L = L(G) is x(L)- 
colorable. 


PROOF: Let G be a bipartite graph on the vertex set X UY. Let g 
be a coloring of the vertices of L = L(G) with colors 1,2...,x(L). 
We orient L to form a digraph D as follows: if e: = {71,y} and 
€2 = {£2, y2} are adjacent vertices of L and g(e1) < g(ez), then 


{ the edge is from e, to eg if x; = Xo, 


the edge is from e2 to e; if y, = yo. 


Since every clique in L corresponds to a vertex x € X or y € Y, this 
orientation is normal. By Proposition 17.8 every induced subgraph 
of D has a kernel. Clearly, 1+ outdegree(e) < x(L) for every vertex 
e € D. So, by Proposition 17.7 the proof is complete. 


Problem 17G. 

(a) Let n be even. Find a permutation 21, 2,...,2,, of the ele- 
ments of Zn such that the differences Li41 — Xj, 1 <i <n, are all 
different. 

(b) Show that this is not possible if n is odd. 

(c) Consider the permutation of (a). Define aj; := x; + 2;. Show 
that the square with entries a;;, 1 < 7,7 <n, is a Latin square that 
has the following property: The n(n — 1) adjacent pairs (a;;, a; 41) 
are different. Such a square is called row-complete. This square is 
also column-complete. 


Problem 17H. Describe a one-to-one correspondence between 
symmetric Latin squares of order n in which all symbols appear 
on the main diagonal and symmetric Latin squares of order n + 1 
with all (2+ 1)’s on the diagonal. 
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Problem 171. Let 1,2,...,n be the first row of a Latin square 
and 2,3,...,n,1 the second row. How many candidates are there 
for the third row? 


Problem 17J. Let 1,2,...,2n—1,2n be the first row of a Latin 
square of order 2n. Let 2,1,4,3,...,2n,2n—1 be the second row. 
(i) Find a formula for the number of candidates for the third row. 
(ii) Give a permanent that has this number as value. 


Notes. 

Although Latin squares have been studied for more than 200 
years, much of the material in this chapter is recent. Theorem 17.1 
is the oldest; it is due to M. Hall, Jr. (1945). 

Theorem 17.2 and the remark following it can be found in H. J. 
Ryser (1969). At the time that paper was written, the Van der 
Waerden conjecture was still open. 

Theorem 17.3 is new. 

Theorem 17.4 is again due to H. J. Ryser (1951). 

The so-called Evans conjecture occurs as a question (not as a 
conjecture) in a paper by T. Evans (1960). Before the proof by B. 
Smetaniuk appeared, several partial results were published. 

Our treatment of the Galvin proof of the Dinitz conjecture is 
based on some unpublished notes of D. Hoffman. 

The most extensive treatment of Latin squares can be found in 
the book Latin Squares by J. Dénes and A. D. Keedwell (1991). 
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Hadamard matrices, 
Reed—Muller codes 


Hadamard considered the following question. Let A be an n x n 
matrix with real entries of absolute value at most 1. How large can 
the determinant of A be (in absolute value)? Since each row of A 
is a vector with length < ./n, the determinant cannot be larger 
than n”/?. (The absolute value of the determinant is the n-dimen- 
sional volume of the parallelopiped spanned by the row vectors in 
Euclidean n-space.) Can equality hold? In that case all entries 
must be +1 or —1 and furthermore any two rows must be orthog- 
onal, i.e. they have inner product 0. This leads to the following 
definition. 

A Hadamard matrix of order n is ann xX n matrix H with entries 
+1 and —1, such that 


(18.1) HH' =nl. 


Of course, any two columns of H are also orthogonal. This property 
does not change if we permute rows or columns or if we multiply 
some rows or columns by —1. Two such Hadamard matrices are 
called equivalent. For a given Hadamard matrix, we can find an 
equivalent one for which the first row and the first column consist 
entirely of +1’s. Such a Hadamard matrix is called normalized. 
Clearly the remaining rows (if any) have as many +1’s as —1’s, i.e. 
if n #1 then n must be even. Some small examples are 


+ + + 
oe ae 
ee Se 
- - + 


++4++ 
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where, in the last example, we have only indicated the sign of the 
entry. 


Problem 18A. Show that any two Hadamard matrices of order 
12 are equivalent. 


Theorem 18.1. If H is a Hadamard matrix of order n, then n = 1, 
n = 2, orn =0 (mod 4). 

PRooF: Let n > 2. Normalize H. We can permute columns in 
such a way that the first three rows of H become 


foe bH fH HH FH HH FH HH 
++4+---+4+ ++.---+-+4 _— Ee —— a eee ae 


++.---+4 ae i te ge ++.---++ a eet ee 
sw” a ae 
a columns b columns c columns d columns 


We have a+6+c+d =n and the three inner products formed 
from these rows yield a+ b—c—d=0,a—b+c—d= 0, and 
a—b—c+d= 0. If we add these equations, we find n = 4a, proving 
the theorem. (In a similar way we see that 4b = 4c = 4d = n.) 


One of the famous conjectures in the area of combinatorial de- 
signs states that a Hadamard matrix of order n exists for every 
n = 0 (mod 4). We are still very far from a proof of this conjec- 
ture. The smallest n for which a Hadamard matrix could exist but 
no example is known is presently 428. There are very many con- 
struction methods for Hadamard matrices, of which we shall treat 
a few. First we define a second class of matrices, very similar to 
Hadamard matrices. 

A conference matrix C of ordern is ann X n matrix with 0’s on 
the diagonal, +1 or —1 in all other positions and with the property 


(18.2) CO’ =(n—1)I. 


The name conference matrix originates from an application to con- 
ference telephone circuits. V. Belevitch (1950) studied so-called 
ideal nondissipative networks consisting of ideal transformers, to 
be used to set up a conference telephone network. The theory led 
to a necessary condition for the existence of such a network, namely 
the existence of a conference matrix of order n, where n is the num- 
ber of terminals of the network. This explains the name of these 
matrices. 
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Problem 18B. Let C be a conference matrix of order n ¥ 1. 
Show that n is even. Show that by permuting rows and columns, 
and multiplying certain rows and columns by —1, we can find an 
equivalent conference matrix that is symmetric if n = 2 (mod 4), 
and antisymmetric if mn =0 (mod 4). 


Theorem 18.2. If C is an antisymmetric conference matrix, then 
I+C is a Hadamard matrix. 


Proor: ([+C)([+C)' =I1+C+C'+CC!' =I4+(n-1)l=nl. 


Theorem 18.3. IfC is asymmetric conference matrix of order n, 


then 
H- I+C -I+C 
SS Nee ACS. ef ay 


is a Hadamard matrix of order 2n. 


Proor: Again calculate HH' and the result follows. 


One of the most common construction methods for several com- 
binatorial structures is the so-called recursive method in which a 
large object of the required type is made by applying some pro- 
cedure to two or more smaller objects of the same type. In the 
next theorem we demonstrate such a method for the construction 
of Hadamard matrices. 

Let A be an m x n matrix with entries a;; and B another matrix. 
The matrix 


ai,B aj2B exete AinB 
aB AB are A,B 
Ami B AmB tee sere eI 


consisting of mn blocks with the size of B is called the Kronecker 
product A ® B of the matrices A and B. 


Theorem 18.4. If H,, and H,, are Hadamard matrices of order m 
and n respectively, then H,, ® H, is a Hadamard matrix of order 
mn. 


PROOF: By straightforward calculation one sees that 


(A® B)(C ® D) = (AC) ® (BD) 
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and that 
(A®B)'=A' @B'. 


We take A = C = HH, and B = D = H,,. The result follows from 
the definition of a Hadamard matrix and the fact that I, @In = Inn. 


ene we find a 


By repeatedly applying this theorem to H2 := 
sequence of Hadamard matrices that we shall denote by H,,, where 
I ee es 

We now come to a direct construction method for conference 
matrices, and these can then be used with Theorems 18.2 and 18.3 
to construct Hadamard matrices. In the following, q is a power of 
an odd prime. On the field F,, we define a function y (a so-called 
character) by 


0 ite =, 
xe) s= <1 if x is a nonzero square, 


—1 if x is a nonsquare. 


For any x and y in F,, we have x(x)x(y) = x(ay), and since there 
are aS many nonzero squares as nonsquares we also have 


(18.3) SxS 


vel, 


Now let 0 Ac € F,. Then (18.3) implies that 


(18.4) S- x(b)x(b + ¢) = -1. 


bek, 


This is seen by ignoring the term with b = 0, which is 0, and then 
writing x(b +c) = x(b)x(1 + cb~'); (note that y(b)? = 1 if b 4 0). 
If b runs through all nonzero elements of the field, then 1+ cb"! 
takes on every value except 1. 

Number the elements of F, : 0 = ao,a1,...,@g-1. We define a q 
x g matrix Q by 


Gj =X(a—aj), O45 <q. 


18. Hadamard matrices, Reed-Muller codes 203 


Note that Q is symmetric if gq = 1 (mod 4), and antisymmetric 
if ¢g = 3 (mod 4). As a direct consequence of the elementary 
properties of y and of (18.4), we find that QQ' = qI — J and 
QJ = JQ =O. A matrix C of size q+1 x q+ 1 is defined by 


(18.5) Cee" ’ 
+1 


where the signs of the terms +1 are chosen in such a way that 
C' is symmetric or antisymmetric. From the properties of Q, it 
now follows that C' is a conference matrix of order q+ 1. This 
construction is due to Paley (1933) and the conference matrices of 
this type are usually called Paley matrices. For the special case 
that q is a prime, the matrix Q is a circulant. We summarize the 
constructions as a theorem. 


Theorem 18.5. If q is a power of an odd prime, then a Hadamard 
matrix of order q+ 1 exists if q = 3 (mod 4), and a Hadamard 
matrix of order 2(q + 1) exists if ¢ = 1 (mod 4). 


+$+tttttt+s++ 
a 
—4++4+-4+4+4---+ 
+-4+4+-444--- 
—+-4+4-4+4+4-- 
—--+-4+4+-44+4- 


—--+4+-44-44+4+ 


Figure 18.1 
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+ 


fe eres Ole a I 


Figure 18.2 


Figures 18.1 and 18.2 illustrate Hadamard matrices of order 12 
constructed from Paley matrices of orders 11+ 1 and 5+ 1, respec- 
tively. 


Problem 18C. Show that a Hadamard matrix of order n exists if 
n =0 (mod 4), n < 100, except possibly for n = 92. 


There was a period of thirty years between Paley’s result and the 
discovery of a Hadamard matrix of order 92 by L. D. Baumert, S. 
W. Golomb, and M. Hall. The method that they used was devel- 
oped by Williamson in 1944 but a computer search was necessary 
to find the actual matrix. Williamson’s method is based on the 
following observation. Let the matrices A;, 1 <i < 4, of order n,n 
odd, be symmetric and assume that they commute with each other. 
Consider the matrix H defined by 


Be Aye Ae ay 
SAG si tie. 2A 
SAS. Ap dA As 
At Sil, cae. oA 


(18.6) 


We then have 


(18.7) HH' = 1,@ (Aj + AS + Aj + Aj). 


18. Hadamard matrices, Reed—Muller codes 205 


To construct a Hadamard matrix in this way, we have to find ma- 
trices A; that satisfy the conditions stated above, have entries +1, 
and furthermore satisfy 


(18.8) Aj + A} + AZ + Aj =4nIn. 


Let U be the permutation matrix of order n corresponding to 
the permutation (12...n), ie. uj = 1 if and only if 7 -i = 1 
(mod n). Then U” = J and any circulant is a linear combination 
of powers of U. If we assume that the matrices A; have the form 
A; = a a;jU! with aj = 1 and ajj = ain—;, then the matrices 
do indeed commute and they are symmetric. From now on, we also 
assume that all the a;; are +1 and that (18.8) is satisfied. A simple 
example of a Hadamard matrix constructed in this way is found 
by taking n = 3, A, = J, and A; = J — 21, i = 2,3,4. We find a 
Hadamard matrix of order 12; see Fig. 18.3. 


| 
| + 
| 


++ 


—- 
++ 
++ 
++ 
++ 
— 


| +/+ 4+ 


+/+ 4 | 
+ 

J+ tt++4/+ 
| 


+44 


Figure 18.3 


Example 18.1. Note that our construction implies that A; has 
constant row-sum a; (odd) and by (18.8) we have aj+---+a4 = 4n. 
If we wish to use this method to find a Hadamard matrix of order 
20, we first write 20 as a sum of four odd squares: 20 = 9+9+1+1. 
From this we see that two of the matrices A; must be 2J — J and 
then it is not difficult to see that the other two have as first row 
+—++-, respectively + -+—-—-+. 
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To analyze the situation a little further, we introduce matrices 
W;, and P,; as follows: 


(18.9) QW; := (Ai +--+ + Ag) — 2Ai, 


(18.10) A; = 2P, — J. 


Our conventions on the coefficients a;; imply that if p; is defined 
by p;J := PJ, then p; is an odd integer. Furthermore (18.8) and 
(18.9) imply that 


(18.11) We +W2+W24+W? =4nl. 
By substituting (18.10) in (18.8), we find 


4 4 
(18.12) SOP =(S pi -n)J t+ al. 
i=1 


i=1 


Suppose the term U", k 4 0, occurs in a of the matrices P;. Con- 
sidering the equation (18.12) mod 2, we find U”* on the left-hand 
side with coefficient @ and on the right-hand side with coefficient 1. 
So a is odd and this implies that U" occurs in exactly one of the 
matrices W; (with coefficient +2). From Example 18.1 and (18.9), 
we see that the constant row-sums w; of the matrices W; satisfy 
wi +-:++ wi =4n. These facts reduce the number of possibilities 
for matrices W; also satisfying (18.11) sufficiently to make a com- 
puter search for such matrices feasible. The A; are then found from 
(18.9). We list the first rows of the four matrices of order 23 that 
produced the first Hadamard matrix of order 92: 


7 a ee ec a or Ce 
1 i a 
Fe a i oo ee eo ee 
Ai ahaa eae a a ee 


Problem 18D. Construct a Hadamard matrix of order 28 using 
Williamson’s method. 


We look at a different problem concerning Hadamard matrices. 
If a Hadamard matrix of order n is normalized, then it obviously 
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has exactly n more entries +1 than entries —1. We define the excess 
of a Hadamard matrix to be the sum of all the entries, and then 
define a(n) as the maximal value of the excess of all Hadamard 
matrices of order n. The following bound due to Best (1977) shows 
that a(n) grows as n3/?, 


Theorem 18.6. n?2-"(1",) < a(n) < ny/n. 
2 


PRooF: (a) Let H be a Hadamard matrix of order n. Let s; be 
the sum of the k-th column of H. Let c; be the 7-th row of H, 
1<i<n. We calculate >)1<; ;<,(Ci,¢j) in two ways. By the 
definition of Hadamard matrix this sum is n?. On the other hand 


nm nm 
> ) = 2 
CikCjk = Sp. 
1<aj<n k=1 k=1 


From this and the Cauchy-Schwarz inequality we find 


n 


So 5 < (no sf)? = avi, 
k=1 


k=1 


so a(n) < n/n. 

(b) Let x be any vector in {+1,—1}". We multiply column j of 
H by xj, 1 <j <n. Subsequently, multiply those rows that have 
more terms —1 than +1 by —1. Call the resulting matrix H, and 
define o(H,.) := )>y_, |(x, ¢;)|. Clearly o(n) is at least equal to the 
average value of o( Hx). So we find 


n 


o(n)>2" S° o(Ax) =2"S—S- | (x, ci)| 


x€{+1,-1}” x i=l 


= S>>> S- |n — 2d| 


i=1 d=0 x,d(x,c;)=d 


—n - n —n n 
=2 no in—2al(") = n%2 (;,,) 
d=0 
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Corollary. 27'/?n3/? < a(n) < n3/?. 


PRooF: That the left-hand side of the inequality in Theorem 18.6 
is asymptotically equal to 2!/22~!/2n3/? follows from Stirling’s for- 
mula. To obtain an inequality that holds for all n, we must replace 
the constant by 27!/?. 


Example 18.2. Consider a square of size 4 by 4, divided into 
16 cells that we number from 1 to 16. A matrix A with rows a;, 
1 <i < 16, is defined by taking a;; = —1 if 7 occurs in the square 
in the same row or column as 7, but 7 4 7; otherwise a;; = 1. For 
any two rows of A, there are two positions where they both have 
entry —1. It follows that A is a Hadamard matrix of order 16 with 
excess 64, i.e. for this matrix we have equality for the upper bound 
in Theorem 18.6. 

This Hadamard matrix can also be constructed as follows. Define 
HT := J —2I, a Hadamard matrix of order 4 with maximal excess, 
namely 8. Every row of H has the same number of terms +1, 
namely 3. The matrix H ® H is a Hadamard matrix of order 16, 
with 10 terms +1 in each row, and the maximal excess 64. If we 
take the Kronecker product of this matrix with H, we again find a 
Hadamard matrix with constant row-sums and maximal excess, etc. 
In general, a Hadamard matrix of order 4u?, all row-sums equal to 
2u, and hence maximal excess, is called a regular Hadamard matrix. 
Other constructions of such matrices are known. 


One of the very interesting and successful applications of Had- 
amard matrices was their use as so-called error-correcting codes. 
Many readers will have seen the excellent pictures that were taken 
of Mars, Saturn and other planets by satellites such as the Mariners, 
Voyagers, etc. To transmit such a picture to Earth, it is first di- 
vided into very small (pixels) and for each such square the degree 
of blackness is measured and expressed, say in a scale of 0 to 63. 
These numbers are expressed in the binary system, i.e. each pixel 
produces a string of six 0’s and 1’s (bits). The bits are transmitted 
to the receiver station on Earth (the Jet Propulsion Laboratory 
at Caltech). Due to a number of sources of noise, one of which 
is thermal noise from the amplifier, it happens occasionally that a 
signal that was transmitted as a 0, respectively a 1, is interpreted 
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by the receiver as a 1, respectively a 0. If each sextuple correspond- 
ing to a pixel is transmitted as such, then the errors made by the 
receiver would make the pictures very poor. Since there is only a 
limited amount of time to transmit the pictures and it is known 
how much energy is available from the solar panels, we know the 
average energy per user bit available to generate signals to be sent 
to the receiver. From this we can calculate the error probability p 
(per bit). 

Suppose that for a useful picture, each sextuple may be wrong 
with a probability Pg at most 1074. To achieve this by simply 
transmitting the bits, we need p ~ 10~*/6 since Pg = 1—(1—p)° = 
10-*. Let us first assume that the required energy is available 
and try to improve the picture by doing the following. Instead 
of sending a bit, say 0, we could send five 0’s and the receiver 
translates a received fivetuple into the bit that occurs most. We say 
that we are using a code, called the repetition code, with two words 
of length 5, namely 00000 and 11111. The number 1/5 is called 
the information rate of this code. The power constraint allows us 
to calculate how much energy we have available per channel bit. 
Subsequently, it is possible to calculate the new error rate p’ (per 
channel bit). Of course p’ > p and in our example we in fact have 
p’ = 0.035 (nearly 2000 times as large as without coding). The 
coding only makes sense if error correction more than makes up for 
the loss of energy per bit. 

The probability that a transmitted pixel is received correctly now 
becomes 

[(1 — p')? + 5p'(1 —p’)* + 10’)? — p')]?° 
and this is © 0.997. In other words, we have completely spoiled 
our picture! 

Let us now look at what was actually done in the Mariner 1969 
expedition. The 64 possible information strings (corresponding to 
the possible degrees of blackness of a pixel) were mapped onto the 
rows of the matrices H32 and —H32. Since we now have codewords 
of length 32 representing information words of length 6, the infor- 
mation rate is 6/32, nearly the same as for the repetition code. 
Note that any two of these codewords either differ in all 32 posi- 
tions or differ in exactly 16 of the positions. It follows that if a 
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received word contains at most 7 errors, then it resembles the cor- 
rect word more than any of the 63 other words. (We have changed 
our symbols to +1 instead of 0 and 1.) We say that this code is a 7- 
error-correcting code. The new error probability now is p’ + 0.036 
(again larger). The probability that a received word is decoded 
incorrectly now is 


> (? era 


i=8 


and this is roughly 1.4-10~° which is an order of magnitude better 
than Pr. 

In practice, there was not enough energy to transmit good pic- 
tures but by using this code, that problem was solved. 

What we have seen is one example of a sequence of (low rate) 
codes, known as first order Reed—Muller codes. Consider once again 
the construction of the Hadamard matrices H,,, where n = 2”. In 
Fig. 18.4 we show Hg. 

In the matrix H, we replace each +1 by 0 and each —1 by 1. 
Number the rows from 0 to n — 1 = 2” — 1. What happens in the 
Kronecker product construction when we go from m to m+ 1, i.e. 
double the order of the matrix? The new rows (now interpreted as 
vectors in F?”) have the form (c;,¢;) for 0 < i < n and they have 
the form (c;,c¢; + 1) for n < i < 2n, where 1 denotes the all-one 
vector. 


= 
—- 
a 
—- 
= 
—- 
+ 
a 


Figure 18.4 


Theorem 18.7. Let R'(1,m) denote the set of row vectors in F;’ 
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obtained from H,, as described above. Then R’/(1,m) is an m- 


dimensional subspace of F;}. 


PROOF: For m = 1, the assertion is trivial. We proceed by induc- 
tion. Let vj, vo,...,Vm be a basis of R’(1,m). Our observation 
made above shows that R’(1,m +1) consists of all linear combina- 
tions of the vectors (v;,v;) and the vector (0,1). This completes 
the proof. 


We now define the first order Reed—Muller code R(1, m) of length 
n = 2” and dimension m+ 1 to be the subspace of Fy’ spanned 
by the space R’(1,m) and the all-one vector of length n. In this 
terminology, the code used by Mariner 1969 was R(1,5). From 
the properties of Hadamard matrices, we immediately see that any 
two codewords in R(1,m) differ in at least 5n places. We say that 
the code has minimum distance d = 2"~'. Therefore the code can 
correct up to 2'~? — 1 errors. 

We can give the codewords a nice geometric interpretation. Let 
us number the points of the space Fj” by considering a vector as 
the binary representation of its number. For example, (0,1, 1,0, 1) 
is considered as the point Po in Re Let these representations be 
the columns of an m x n matrix. Let v; be the 7-th row of this 
matrix, 1 <7< m. Then from Fig. 18.4 and the observations 
above, we see that the vectors v; are the natural basis for R’(1,m). 
The basis vector v; is the characteristic function of the hyperplane 
{(@1,%2,.--,2m) € KF”: x; = 1}. By taking linear combinations, we 
see that the codewords of R(1,m) are exactly all the characteristic 
functions of the affine hyperplanes in the vector space and the char- 
acteristic function of the space itself (for 1) and of the empty set 
(for 0). Any two affine hyperplanes are either parallel or they meet 
in an affine subspace of dimension m — 2, in accordance with the 
fact that two rows of the Hadamard matrix have the same entries 
in exactly half of the positions. 

As our definition of R(1,m) suggests, Reed—Muller codes of higher 
order are also defined in the theory of error-correcting codes. They 
also have a geometric interpretation that we shall not go into here 
(but see Example 26.4). 


Problem 18E. For n = 2” and 1 <i <™m, we define the matrix 
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Mi; by 
Mo = Lgm-i ® Hy ® Ipi-1. 


Prove that 
H, =MOmM® ... mim, 


n n nm 


A received word x is decoded by calculating xH,). If there are not 


too many errors, then all entries of this product will be nearly 0 
except one with an absolute value close to n, telling us what the 
true message was. A multiplication by +1 is called an operation. 
Compare the number of operations that are necessary for decoding 
when H,, is used and when the representation of H, as a product 
of matrices MM, is used. (This is an example of what is known as 
a Fast Fourier Transform.) 


Problem 18F. Let v;, 0 < i < m, where vp = 1, be the basis 
ng 


of R(1,m) given above. We consider the subspace R(2,m) of EF, 
where n = 2”, spanned by all the vectors 


VirVjii= (vi0v;0, tee Vig) 


What is the dimension of this space? Show that any two vectors of 
R(2,m) differ in at least in places. 


Problem 18G. Suppose M is an m x n (0,1)-matrix so that the 
Hamming distance between any two distinct rows is at least d. (If 
M is the result of stacking a Hadamard matrix H on top of —H 
and changing the symbols to 0’s and 1’s, we have an example with 
m=2nend.d=1/2.) 

(1) Count the number of ordered triples (i, 7, &) such that 7 and 
j are (indices of) distinct rows and k is a column where M(i,k) # 
M(j,k) in two ways—one yielding an inequality involving d and 
the other involving the column-sums of M. Prove that if 2d > n, 
then 


Z 2d 
= a 


(This is known as Plotkin’s bound in coding theory.) What condi- 
tions ensure equality? 


18. Hadamard matrices, Reed—Muller codes 213 


(2) Suppose d = n/2. Prove that m < 2n and show that equality 
implies the existence of a Hadamard matrix of order n. 


Problem 18H. Let H be a Hadamard matrix of order m and 
let C be a symmetric conference matrix of order n. Let P be the 


O- 


matrix of size m with the form ( . where the submatrices have 


size m/2. Prove that (HW ®C) + (PH ® I) is a Hadamard matrix 
of size mn. 


Problem 18]. Let g be a prime power = 1 (mod 4). Let C be the 
matrix of (18.5). Choose H and P as in Problem 18H. Prove that 


(H®C8Q)4+ (PH QC @1,) + (A @ Iqy1 ® Jy) 


is a Hadamard matrix of size mq(q + 1). 


Notes. 

J. Hadamard (1865-1963) was a leading mathematician around 
the turn of the century. His most important work was in the theory 
of analytic functions and in mathematical physics. He is best known 
for the proof of the so-called prime number theorem, jointly with 
C. J. De La Vallée-Poussin. 

R. E. A. C. Paley (1907-1933) died in an avalanche while skiing 
at the age of 26. In his short life he produced 26 papers of excellent 
quality (mostly on Fourier Theory). 

A long outstanding conjecture of H. J. Ryser asserts that no 
Hadamard matrix of order n > 4 can be a circulant. 

J.S. Wallis (aka J. Seberry) (1976) proved that for any integer s, 
Hadamard matrices of orders 2's exist whenever t > 2log,(s — 3). 
However, it is still not known whether the set of orders of Hadamard 
matrices has ‘positive density’. 

For more on error-correcting codes, see Chapter 20. 

The codes that are now called Reed-Muller codes were (surpris- 
ingly?) indeed first treated by D. E. Muller (1954) and I. S. Reed 
(1954). 

For a detailed account of the coding and decoding for the Mariner 
missions, see E. C. Posner (1968). 
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Designs 


In this chapter we give an introduction to a large and important 
area of combinatorial theory which is known as design theory. The 
most general object that is studied in this theory is a so-called 
incidence structure. This is a triple S = (P, B,1), where: 


(1) P is a set, the elements of which are called points; 

(2) B is a set, the elements of which are called blocks; 

(3) Lis an incidence relation between P and B (i.e. I C P x B). 
The elements of I are called flags. 


If (p, B) € I, then we say that point p and block B are incident. 
We allow two different blocks B,; and By to be incident with the 
same subset of points of P. In this case one speaks of ‘repeated 
blocks’. If this does not happen, then the design is called a simple 
design and we can then consider blocks as subsets of P. In fact, 
from now on we shall always do that, taking care to realize that 
different blocks are possibly the same subset of P. This allows us 
to replace the notation (p,B) € I by p € B, and we shall often say 
that point p is ‘in block B’ instead of incident with B. 

It has become customary to denote the cardinality of P by v and 
the cardinality of B by b. So the incidence structure then is a set 
of v points and a collection of b not necessarily distinct subsets of 
the point set. The structure obtained by replacing each block by 
its complement is, of course called the complement of the structure. 
(This means that we replace I by its complement in P x B.) 

To obtain an interesting theory, we must impose some regularity 
conditions on the structure S. As a first example, we mention 
incidence structures that have the confusing name ‘linear spaces’. 
Here the blocks are usually called lines and the regularity conditions 
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are that every line contains (i.e. is incident with) at least two points 
and any two points are on exactly one line. Example 19.6 below 
shows a simple but important linear space. The following theorem 
is due to De Bruijn and Erdés (1948). The elegant proof is due to 
Conway. 


Theorem 19.1. For a linear space we have b = 1 or b > v, and 
equality implies that for any two lines there is exactly one point 
incident with both. 


PROOF: For x € P, denote by r, the number of lines incident with 
x, and similarly for B € B, let kg be the number of points on B. 
Let there be more than one line. If  ¢ L then r, > kz because 
there are ky, lines ‘joining’ x to the points on L. Suppose b < v. 
Then b(v — kz) > v(b—- ie and hence 


eV eg} 


xEP Tae” LeB eri 


and this implies that in all the inequalities, equality must hold. 
Therefore v = b, and r, = ky if x ¢ L. 

A trivial example of equality in Theorem 19.1 is a so-called near 
pencil, a structure with one line that contains all the points but 
one, and all pairs containing that point as lines of size two. Much 
more interesting examples are the projective planes that we shall 
define later in this chapter. These are the only possibilities. See 
Problem 23C. The reader can prove this fact for linear spaces as an 
exercise. 

In the rest of this chapter, we shall be interested in highly regular 
incidence structures called ‘t-designs’. Let v,k,t and A be integers 
with v >k >t >OandA> 1. A tdesign on v points with 
block-size k and index X is an incidence structure D = (P,B,I) 
with: 

(i) [P| =», 
(ii) |B| =& for all BEB, 
(iii) for any set T of t points, there are exactly \ blocks incident 
with all points in 7. 


So all blocks have the same size and every t-subset of the point set 
is contained in the same number of blocks. Two different notations 
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for such a design are widely used, namely t-(v,k,) design and 
S)(t,k,v). We shall use both of them. A Steiner system S(t, k, v) 
is a t-design with \ = 1, and we suppress the index in the notation. 
Most of the early theory of designs originated in statistics, where 2- 
designs are used in the design of experiments for statistical analysis. 
These designs are often called balanced incomplete block designs 
(BIBDs). Usually trivial designs are excluded from the theory: a 
design with one block that contains all the points or a design that 
has all the k-subsets of the point set as blocks is of course a t-design 
for t <k, but is not very interesting. 
We give a few examples; more will follow further on. 


Example 19.1. Let the nonzero vectors of F} be the points. As 
blocks we take all triples {x,y,z} with x + y+z = 0. Any pair 
x,y with x # y uniquely determines a third element z, different 
from both, satisfying this equation. So we have constructed an 
$(2,3,15). The blocks are the 2-dimensional subspaces of F5 with 
0 deleted. 

We construct a second design by taking all the vectors as point 
set and defining blocks to be 4-tuples {w,x,y,z} for which w + 
x+y+z=0. This defines an $(3,4,16). Note that if we take the 
blocks that contain O and delete this vector, we find the blocks of 
the previous design. 


Example 19.2. We take the ten edges of a Ks as point set. Each 
of the three kinds of 4-tuples shown in Fig. 19.1 will be a block. 


A 


Figure 19.1 


There are 5+10+15 = 30 blocks. No triple (of edges) is contained 
in more than one block. Therefore the blocks contain 120 different 
triples, i.e. all the triples. We have constructed an S(3, 4, 10). 
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Example 19.3. Let H be a normalized Hadamard matrix of order 
4k. Delete the first row and the first column. We now identify 
points with rows of this matrix. Each column defines a subset of 
the rows, namely those rows for which there is a + in that column. 
These subsets are the blocks. From the argument in Theorem 18.1, 
we see that any pair of points is contained in exactly k—1 blocks and 
clearly all blocks have size 2k—1. We have a 2-(4k—1,2k—1,k-1) 
design and such a design is called a Hadamard 2-design. 

Consider the same matrix H but now delete only the first row. 
Each of the other rows determines two 2k-subsets of the set of 
columns. This partition is unaffected if we change the sign of the 
row. The argument of Theorem 18.1 now shows that for any three 
columns, there are exactly k — 1 of the subsets that have three 
elements in these columns. So these 2k-sets are the blocks of a 
3-(4k, 2k, k — 1) design called a Hadamard 3-design. 


Example 19.4. Consider a regular Hadamard matrix of order 4u? 
(see Example 18.2). If we replace + by 1 and — by 0, we find 
a (0,1)-matrix with 2u? + u ones in every row and column, and 
furthermore, any two rows or columns have inner product u? + uw. 
Let the columns be the characteristic functions of the blocks of a 
design on 4u? points. The properties of the matrix show that this 
is a 2-(4u”, 2u? + u,u? + u) design. One usually prefers considering 
the complement of this design, i.e. a 2-(4u, 2u? — u, u2 — u) design. 


Problem 19A. Here are two more examples in the spirit of Ex- 
ample 19.2. 

(i) Take the edges of Kg as points of an incidence structure. The 
blocks are to be all sets of three edges that either are the edges of 
a perfect matching, or the edges of a triangle. Show that this is an 
S(2,3,15) and show that it is isomorphic to the design in Example 
19.1. 

(ii) Take the edges of K7 as points of an incidence structure. 
The blocks are to be all sets of five edges of these three types: (a) 
‘claws’ with five edges incident with a common vertex, (b) edge sets 
of pentagon subgraphs, and (c) five edges that form a triangle and 
two disjoint edges. Show that this is an $3(3, 5,21). 


We now give two elementary theorems on t-designs. 
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Theorem 19.2. The number of blocks of an S)(t,k,v) is 


(19.1) b= (7) / @ 


PROOF: Count in two ways the number of pairs (T, B), where T is 
a t-subset of P and B is a block incident with all points of T.. We 
find \(°) = b({). 


Theorem 19.3. Giveni, 0 <i <t, the number of blocks incident 
with all the points of an i-subset I of P is 


(19.2) w= a(t (Ft). 
t—2 t—2 
That is, every S)(t,k,v) is also an i-design for i < t. 


PRooF: Count in two ways the number of pairs (T,B), where T’ 
is a t-subset of P that contains J and B is a block that is incident 
with all the points of 7. 


Corollary. IfD is a t-design with point set P and block set B and 
if I is a subset of P with |I| < t, then the point set P\I and the 
blocks {B\I : I C B} form an $)(t —i,k —i,v —7i). This design is 
called the derived design Dy. 


In Example 19.1 we already saw an example of a derived design. 
If we take I = {0}, the derived design for S(3, 4,16) is $(2,3,15). 


Problem 19B. Show that if an S(3,6,v) exists, then v = 2 or 6 
(mod 20). 


The number of blocks incident with any point, i.e. bj, is usually 
denoted by r (replication number). Two special cases of Theorem 


19.3 are the following relations for the parameters of a 2-design: 


(19.3) bk = vr, 


(19.4) Mv —1)=r(k—1). 
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Theorem 19.4. Let 0 < j < t. The number of blocks of an 
S\(t,k,v) that are incident with none of the points of a j-subset J 
of P is 


(19.5) val’ \G a): 


PROOF: For x € P, let B, be the set of blocks incident with x. We 
use inclusion-exclusion, Theorem 10.1. We find that 


The result follows by substitution of (19.2) and then using (10.5). 

It is quicker to observe that b’ apparently does not depend on 
the particular set J and then count in two ways the pairs (J, B), 
where J is a 7-subset of P and JN B=. So (“) b) = oC): Then 
the result follows from Theorem 19.2. 


Corollary. Ifi+ 7 <t, then the number of blocks of an S)(t, k,v) 
that are incident with all of a set of 1 points and none of a disjoint 
set of 7 points is a constant 


(19.6) bi =d Cea) 


D vt 

(2) 

PROOF: The result follows upon application of Theorem 19.4 to 
the (t — 7)-design D;, where I is the set of i points. 


Corollary. If J is a j-subset of P, j < t, then the point set P\J 
and the blocks B with BN J = form an S,,(t — j,k,v — 7) called 
the residual design D’. 


Problem 19C. (i) Prove that an $)(t,k,v) with v < k+t is trivial. 
(ii) Prove that the complement of an $)(t,k,v) withu >k+tisa 
t-design and determine its parameters. 


Example 19.5. Consider a Hadamard 3-design 3-(4k, 2k, k — 1) 
and form the residual with respect to a set with one point. We find 
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a Hadamard 2-design 2-(4k — 1,2k,k), i.e. the complement of the 
design of Example 19.3. 


An obvious necessary condition for the existence of an S(t, k,v) 
is that the numbers b; of (19.2) are integers. However, this condition 
is not sufficient. An $(10,16, 72) does not exist, as is demonstrated 
by the following theorem due to Tits (1964). 


Theorem 19.5. In any nontrivial Steiner system S(t, k,v), 
v>(t+1)(kK-t4+1). 


PROOF: In a Steiner system, any two distinct blocks have at most 
t—1 points in common. Choose a set S of t+1 points not contained 
in any block. For each set T C S with |T| = t, there is a unique 
block Br containing 7’. Each such Br is incident with k — t points 
not in S, and any point not in S is incident with at most one 
such block Br since two such blocks already have t — 1 points of 
S in common. This shows that the union of all blocks Br contains 
(t+ 1)+ (t+ 1)(k-—t) points and the result follows. 


Given an incidence structure with |P| = v and |B| = 6b, the 
incidence matrix N is the v x 6 matrix with rows indexed by the 
elements p of ?, columns indexed by the elements B of B, and 
with the entry N(p, B) = 1 if p is incident with B, N(p,B) = 0 
otherwise. Note that the entry in row p and column q of NN! 
is the sum of N(p,B)N(q,B) over all blocks B, and this is the 
number of blocks that contain both p and q. Dually, the entry in 
row A and column B of N'N is the cardinality of AN B. 

Two designs D and D’ with incidence matrices N and N’ are 
called isomorphic or equivalent if there are permutation matrices P 
and Q such that N’ = PNQ. 

We shall often identify N with the design, i.e. we refer to the 
columns as blocks instead of as characteristic functions of blocks. 

Now if N is the incidence matrix of a 2-design, then NN' has 
the entry r everywhere on the diagonal and entries X in all other 
positions, i.e. 


(19.7) NN! =(r=A)I+A4, 


where I and J are v X v matrices. 
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Problem 19D. Let N be an 11 by 11 (0,1)-matrix with the fol- 
lowing properties: (i) every row of N has six ones; (ii) the inner 
product of any two distinct rows of N is at most 3. Show that N 
is the incidence matrix of a 2-(11,6,3) design. Furthermore show 
that this design is unique (up to isomorphism). 


The following theorem is known as Fisher’s inequality. 


Theorem 19.6. For a 2-(v,k,A) design with b blocks and v > k 
we have 
b> v. 


PROOF: Since v > k, we have r > X by (19.4). Since J has one 
eigenvalue v and its other eigenvalues are 0, the matrix on the right- 
hand side of (19.7) has v — 1 eigenvalues (r — A) and one eigenvalue 
(r—\) + Av = rk. So it has determinant rk(r — \)’"! 4 0 and 
hence N has rank v. This implies that b > v. 


From the argument in the preceding proof, we can make a very 
important conclusion, given in the next theorem. 


Theorem 19.7. If a 2-(v,k,) design has b = v blocks and v is 
even, then k — X must be a square. 


PROOF: Since b = v, we have r=k. Now N is av xX v matrix and 
by (19.7) 
(det N)? = k?(k— A)’. 


Since det N is an integer, we are done. 
Theorem 19.6 was generalized by A. Ya. Petrenjuk (1968) to 

b> (3) for any $)(4,k,v) with v > k +2 and finally generalized to 

arbitrary t-designs by Ray-Chaudhuri and Wilson (1975). 


Theorem 19.8. For an S)(t,k,v) with t > 2s andv >k+5s, we 
have b > (ye 


PROOF: We introduce the higher incidence matrices of the t-design 
D = S\(t,k,v). For i = 0,1,2,..., let N; denote the (°) x 6 ma 
trix with rows indexed by the 7-element subsets of points, columns 
indexed by the blocks, and with entry 1 in row Y and column B 
if Y C B, 0 otherwise. For 0 < 2 < j < v, we use Wj; to denote 
the i-th incidence matrix of the incidence structure whose blocks 
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are all the j-element subsets of a v-set. Thus W;; is a ) x (‘) 
matrix. 
We claim that 


NN. = >_ 0), ;Wis Wie. 
i=0 


To see this, note that N,N,’ has rows indexed by s-element subsets 
FE and columns indexed by s-element subsets F’ of the points, and 
for given E and F, the entry in row EF and column F of N,N.) is 
the number of blocks that contain both & and F’. This number is 
bos, Where w := |B F|. The entry in row E and column F of 
WW; is the number of 7-subsets of the points contained in both 
E and F, i. (#). So the (E, F)-entry on the right-hand side of 
the equation is }7;_, b},_;(“), and from (19.6) it follows that this is 
Boga 

The (°) x (°) matrices b},_;,W,;| Wj, are all positive semidefinite, 
and b3W,|W,, = b%I is positive definite since b§ > 0 (v > k+ 8). 
Therefore N,N,’ is positive definite and hence nonsingular. The 
rank of N.NJ is equal to the rank of N,, i.e. N, has rank Cs and 


this cannot exceed the number of columns of N,, which is b. 


If equality holds in the Wilson—Petrenjuk inequality, Theorem 
19.8, then the 2s-design is called tight. The only known examples 
with s > 1 and v > k +s are the unique Steiner system $(4, 7, 23) 
that we treat in the next chapter, and its complement. 

It is useful to give some idea of the history of t-designs. Only 
finitely many Steiner systems S(t,k,v) with t > 4 are known. The 
most famous are the designs $(5, 8,24) and $(5,6,12) found by E. 
Witt (1938) and the derived 4-designs. These will appear in the 
next chapter. R. H. F. Denniston (1976) constructed S(5, 6, 24), 
S(5, 7,28), S(5,6,48), and S(5,6,84). W. H. Mills (1978) con- 
structed an $(5,6,72). Again, the derived designs are Steiner 
systems. M. J. Granell and T. S. Griggs (1994) constructed an 
5(5,6, 108). Since then, no others have been found. In 1972, W. O. 
Alltop constructed the first infinite sequence of 5-designs without 
repeated blocks. We remark that it is easy to show that t-designs 
with repeated blocks exist for any t, but for a long time many de- 
sign theorists believed that nontrivial t-designs without repeated 
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blocks did not exist for t > 6. The first simple 6-design was found 
by D. W. Leavitt and S. S. Magliveras in 1982, and in 1986 D. L. 
Kreher and S. P. Radziszowski found the smallest possible simple 
6-design, an 54(6,7,14). The big sensation in this area was the pa- 
per by L. Teirlinck (1987) proving that nontrivial simple t-designs 
exist for all t. His construction produces designs with tremendously 
large parameters and hence the construction of small examples is 
still an open problem. For a number of special parameter sets, it 
has been shown that the corresponding designs do not exist. 

For the remainder of this chapter we shall mainly be interested 
in 2-designs. When t = 2, we often omit this parameter in the 
‘t-(v,k, X)’ notation and speak of (v, k, )-designs. 

A class of designs of special interest are the 2-designs with b = v. 
In this case the incidence matrix N of the design is a square matrix 
and these designs should be called square designs. However, the 
confusing name symmetric designs is standard terminology. (Note 
that N is not necessarily symmetric.) For a symmetric 2-(v, k, X) 
design (19.4) becomes 


A(v —1) = k(k-1). 


Some authors use the name projective design, a name derived from 
the fact that a 2-(v,k,1) design with b = v is called a projective 
plane (see Example 19.7). Despite the fact that we are not happy 
with the name, we shall use the terminology symmetric designs for 
these designs. 


Problem 19E. Let D be a 2-(v,k, X) design with 6 blocks and r 
blocks through every point. Let B be any block. Show that the 
number of blocks that meet B is at least 


k(r —1)*/[(k —1)(4—1) + @ -1)). 
Show that equality holds if and only if any block not disjoint from 
B meets it in a constant number of points. 


Example 19.6. Take as points the elements of Z7 and as blocks 
all triples B, := {x,x+1,x2+ 3} with x € Z;. It is easy to check 
that this yields an S(2, 3,7). The following Fig. 19.2 is often drawn. 
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The lines repesent blocks, but one block must be represented by the 
circle. 


Figure 19.2 


This design is known as the Fano plane. The idea of the construc- 
tion will be extended in Chapter 27. It is based on the fact that the 
six differences among the elements of {0,1,3} are exactly all the 
nonzero elements of Z7. If we wish to find the block containing say 
{1,6}, we observe that 6 — 1 = 1 — 3 and we therefore take x = 5 
and find «+ 1=6,2+3=1, i.e. the pair is indeed in B;. The 
reader should have no difficulty finding an S(2,4,13) in the same 
way, using Z,3. 


A symmetric design with \ = 1 is called a projective plane. If k is 
the size of the blocks, then n = k —1 is called the order of the plane 
(why this is done will become clear in Example 19.7). Expressed 
in n, the parameters of a projective plane of order n are: 


v=n4+n41, k=n+1, ale 


The blocks are usually called lines. The Fano plane is the (unique) 
projective plane of order 2. 


Example 19.7. Consider the vector space FS. This vector space 


contains (q® — 1)/(q—1) = q?+q+1 1-dimensional subspaces and 
the same number of 2-dimensional subspaces. We now construct an 
incidence structure (P, 6,1), where P and B are these two classes 
of subspaces of Fe. If a 1-dimensional subspace is contained in a 
2-dimensional subspace, we say they are incident. It is immediately 
clear that we have thus defined a projective plane of order q, i.e. 
a 2-(q¢?7+q+1,q¢+1,1) design. This design is usually denoted 
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by PG(2,q) or PG2(q), which stands for projective geometry of 
dimension 2 and order q. 

The construction defined above can also be applied if we re- 
place F, by R. We then obtain the classical real projective plane, 
where points are the 1-dimensional subspaces and lines are the 
2-dimensional subspaces. This geometry contrasts with classical 
affine geometry in the fact that no two lines are parallel. When 
speaking about the designs defined above, we use terminology from 
geometry. 


Problem 19F. Find a subset S = {s1,...,$5} of Zo, such that 
the elements of Zg; as points and the 21 blocks S +a (a € Zy1) 
form a projective plane of order 4. (Hint: there is a solution S for 
which 25 = S.) 


Problem 19G. Let (R,C,5;L) be a Latin square of order 6. De- 
fine P := Rx C. Let B be the set of blocks 


Bi = 
{(,y)ERxC:x=tiory=j or L(x,y) =L,7)} \ {ah 


for i,j) ERxC. 


(1) Show that this defines a 2-(36,15,6) design. 
(2) Show that a regular Hadamard matrix of order 36 exists. 


Problem 19H. Let D be a 3-(v,k,X) design. Suppose that the 
derived design of D with respect to a point p (i.e. the case i = 1 in 
the Corollary to Theorem 19.3) is a symmetric design. 
(1) Show that A(v — 2) = (k — 1)(k — 2). 
(2) Show that any two blocks of D meet in 0 or A+ 1 points. 
(3) Show that the set of points not on a block B together with 
the blocks disjoint from B form a 2-design D?. 
(4) Apply Fisher’s inequality to the design D? and deduce that 
v = 2k or otherwise k = (A+1)(A+2) or k = 2(A+1)(A4+2). 


What are the possibilities for the design D? Do we know any 
designs with these properties? 


Problem 19I. Let O be a subset of the points of a projective 
plane of order n such that no three points of O are on one line. 
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Show that |O| < +1 ifn is odd and that |O| <n+2 if n is even. 
A set of n + 1 points, no three on a line, is called an oval; a set of 
n+ 2 points, no three on a line, is a hyperoval. Two constructions 
of PG2(4) were given in Example 19.7 and Problem 19F. In each 
case, construct a hyperoval. 


Problem 19J. Let O be a hyperoval (with q+2 points) in PG2(q), 
q = 2". Any of the q? —1 points p ¢ O has the property that there 
are exactly $(q +2) secants of O through p. Take five points on O 
and split them into 


{{pi,p2}, {p3, pa}, {ps} } - 


This can be done in 15 ways. The two pairs determine two secants 
that meet in a point p ¢ O. The line through p and ps; meets O in 
a point, that we call pg. This defines 15 (not necessarily distinct) 
6-tuples of points on O, containing the given five points. This 
defines an $)(5,6,q + 2), a construction due to D. Jungnickel and 
S. A. Vanstone (1987). Show that the points with ‘coordinates’ (as 
in Example 19.7) (1,¢,t7), t € Fy, with (0,1,0) and (0,0,1) form a 
hyperoval O in PG»(q). Show that a suitable choice of p; to ps 
on O with the construction explained above yields repeated blocks, 
i.e. the 5-design is not simple. 


Any 2-(n?,n,1) design is called an affine plane. The points and 
lines of the plane (= 2-dimensional vector space) oe form an affine 
plane of order q. For such a design we use the notation AG»(n) 
(2-dimensional affine geometry of order 7). 


Example 19.8. Let D be a projective plane of order n. If we 
delete one line and all the points on that line, we find an affine 
plane of order n. 


Problem 19K. Let D be any affine plane of order n. If By and 
By are two blocks, then we write B,; ~ By if the two blocks are 
the same or if they have no points in common. Show that ~ is 
an equivalence relation. A class of this relation is called a parallel 
class. Show that there exists a projective plane of order n such that 
D can be obtained from that plane by the construction of Example 
19.8. 
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We shall now show that if N is the incidence matrix of a symmet- 
ric design D, then N' is also the incidence matrix of a symmetric 
design D', called the dual of D. 


Theorem 19.9. Let N be the incidence matrix of a symmetric 2- 
(v,k,\) design. Then N' is also the incidence matrix of a design. 


PROOF: Consider any block B of the design. For 0 <i < k let a; 
be the number of blocks (4 B) that have i points in common with 
B. Then counting blocks, pairs (p, B’) with p € BN B’ and triples 
(p,q, B') with p # q and {p,q} C BN B’ we find: 


Ya a Y= Hh (k—1), ¥ (sJa=(So-v, 


i=0 i=0 


from which we find S~*_,(i — \)?a; = 0. Hence, any block B’ # B 
has points in common with B, i.e. N'N = (k—A)I+ AJ. 


Note that in Example 19.7, we did not need to specify whether 
the set P was the 1-dimensional subspaces or the 2-dimensional 
subspaces. In the latter situation, we have the dual of the former. 

In many cases the designs D and D! are not isomorphic. 

Let D be a symmetric 2-(v,k,) design. There are two other 
ways to obtain a design from D. These two designs are called the 
derived design and residual design of D. This could be somewhat 
confusing since we have already introduced that terminology. We 
shall always indicate which of the two we mean. Take any block B 
of D. The residual of D with respect to B has P\B as point set and 
as blocks all B’\B with B’ # B. It is a 2-(v —k,k — A, ) design. 
The derived design has B as point set and as blocks all B’N B with 
B's B. It is a 2-(k,A,\ — 1) design. If a design with parameters 
v,k,b,r, is the residual of a symmetric design, then r = k + A. 
Any 2-design for which this equation holds is called a quasiresidual 
design. If such a design is not the residual of a symmetric design, 
then we say that it is nonembeddable. The assertion of Problem 
19K is that every affine plane is embeddable in a projective plane. 
A theorem due to W. S. Connor (1952), that we shall leave until 
Chapter 21, states that every quasiresidual design with A = 2 is 
embeddable. 


19. Designs 229 


010 
Example 19.9. Let C := (: 01) and let E; denote a 3 by 3 
100 
matrix with ones in column 7 and zeros elsewhere. Then 
BE, I Tf I 
N:=|E I C C 
By I.-C? oC 


is the 9 by 12 incidence matrix of AG2(3). Define 


1100 
1100 
111111000000 
1100 
111000111000 
111000000111 peat 
A:= Ne ees Ve) 
000111111000 
1010 
000111000111 
1001 
000000111111 
1001 
1001 


Form the 24 by 16 matrix 


A O 
D:= | N B 
IND, fi. 


One easily checks that D' is the 16 by 24 incidence matrix of a 2- 
(16,6,3) design. This is a quasiresidual design. However, it cannot 
be the residual of a 2-(25,9,3) symmetric design because the inner 
product of row i+6 and rowi+15 of D, 1 <i <9, equals 4 and, 
by Theorem 19.9, the inner product of the columns of the incidence 
matrix of a 2-(25,9,3) design is 3. This shows that nonembeddable 
designs with A = 3 exist. 


The combination of a counting argument and a suitable quadratic 
form that we used to prove Theorem 19.9 is widely used in combina- 
torics. However, sometimes it is easier to use algebraic methods as 
we shall demonstrate in the following theorem, due to Ryser. (The 
reader can try to prove the theorem by using counting arguments. ) 
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Theorem 19.10. Let D = (P, B,1I) be an incidence structure with 
|P| = |B| = v, block-size k, such that any two blocks meet in 
points. Then D is a symmetric 2-design. 


PRooF: Let N be the incidence matrix of D. Then 


(19.8) N'N=(k—A)I+AJ, 
and 
(19.9) JIN = kJ. 


By Theorem 19.9, we are done if we can show that NJ = kJ. From 
(19.8) we see that N is nonsingular and hence (19.9) can be read 
as J =kJN7!. From (19.8) we find JN'N = (k-— A+ Av)J and 
therefore 
JN' =(k—-A+Av)IN7| = (k-A+Av)k", 

i.e. N has constant row-sums. Then these row-sums must be k. 
This proves the theorem and yields (k — A+ Av)k~! = k as was to 
be expected from (19.3) and (19.4). 

As a preparation for the best known nonexistence theorem for 
designs, we need two results, both due to Lagrange. For the first, 
consider the matrix H of (18.6) with n = 1, ie. A; = (a;). Define 
y = (y1, ¥2, 43, y4) by y := xH, where x = (21, %2,%3,24). Then 
from (18.7) we find 


(19.10) (af +a5+a3+a4)(ai+25+23+2%) = (yitysty3t+yi)- 


Using this identity, Lagrange proved that every integer is the sum 
of four squares. Clearly the identity shows that it is sufficient to 
prove this for primes. For an elegant proof that a prime is the sum 
of four squares we refer to Chandrasekharan (1968). 

The following nonexistence theorem is known as the Bruck— 
Ryser—Chowla theorem. 


Theorem 19.11. Ifv,k, are integers such that \(v—1) = k(k— 
1), then for the existence of a symmetric 2-(v,k,) design it is 
necessary that: 
(i) if vu is even then k — X is a square; 
(ii) if v is odd, then the equation z? = (k— A)a? +(—1)-Y/Py? 
has a solution in integers x, y, z, not all zero. 
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PRooF: Assertion (i) was proved in Theorem 19.7. So assume that 
v is odd. Let D be a symmetric 2-(v,k,) design with incidence 
matrix N = (njij) and write n := k — X. We now introduce v linear 
forms [; in the variables x1,...,2, by 


Then the equation N'N = (k—A)I + AJ implies that 
(19.11) Lo +---+ E2 = nla? +--- +07) + May +--+ +2). 


By Lagrange’s theorem, n can be written as n = a?+---+a}. This 
and (19.10) allow us to take four of the variables x; and write 


(19.12) nla} tat.) + ao + 22,5) = (YF + i + Yee + Us), 


where each y; is a linear form in the four variables x;,..., ri+3. 

We now first assume that v = 1 (mod 4). By applying this to 
(19.11), four variables at a time, and introducing w for x1+---+2y, 
we reduce (19.11) to 


(19.13) Lite tBPayite tye, tna t+ dw’. 


Since H in (18.6) is invertible, we can express the variables x; for 
1 <j <v-—1 as linear forms in the corresponding y; and hence 
w is a linear form in these variables and x,. Next, we reduce the 
number of variables in the following way. If the linear form Ly, 
expressed in yj,.--,Yy—1,%v, does not have coefficient +1 for yy, 
then we set Li = y1, and if the coefficient is +1, we set Ly = —y1, 
and in both cases, we subsequently solve this equation for y; as 
a linear expression in the remaining variables y; and x,. This is 
substituted in the expression w. So (19.11) has been reduced to 


Ete + Ly = yh te ty t+ nay t+ Aw’. 


We proceed in this way for yo,...,Yy—1. In each step, w is replaced 
by another linear form in the remaining variables, and hence we 
end up with 

LT? = na? + dw’, 


is 
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in which both LZ, and w are rational multiples of the variable z,. 
If we multiply this by the common denominator of the factors, we 
find an equation 

2? = (k— Aja? + dy? 


in integers. This proves the assertion if v = 1 (mod 4). If v = 
3 (mod 4), the same procedure is applied to (19.13) after adding 
nx?,, to both sides, where z,41 is a new variable. The equation is 
then finally reduced to nx?,, = y2,; + Aw” and again we multiply 
by a common denominator to find an equation of type 


(k— Aja? = 2? + Ay? 


in accordance with assertion (ii). 


Example 19.10. From Example 19.7, we know that a projective 
plane of order n exists for 2 < n < 9, except possibly for n = 6. 
By Theorem 19.11, a necessary condition for the existence of a 
projective plane of order 6 is that the equation z? = 6x? — y” has a 
nontrivial solution. If such a solution exists, then also one for which 
x, y, and z have no prime factor in common, i.e. z and y are both 
odd. Then z? and y? are both = 1 (mod 8). Since 6x? (mod 8) is 
either 0 or 6, we see that the equation has only the trivial solution 
(0,0,0). Therefore a projective plane of order 6 does not exist. 

If we try the same thing for a plane of order 10, we find the 
equation z7 = 10x? — y’?, which has the solution x = 1, y = 1, 
z = 3. In this case Theorem 19.11 tells us nothing. In 1989, Lam 
et al. announced that a computer search involving several hundred 
hours on a Cray 1 had excluded the existence of a projective plane 
of order 10. This is the only case where the nonexistence of a 
symmetric 2-design has been shown using something other than 
Theorem 19.11. 


Corollary. If there exists a projective plane of order n = 1 or 2 
(mod 4), then n is the sum of two integral squares. 


PRroorF: The condition n = 1 or 2 (mod 4) implies that v = n? + 
n+1=3 (mod 4). Theorem 19.11 asserts that n is the sum of two 
rational squares. It is well known that n is the sum of two rational 
squares if and only if n is the sum of two integral squares. (This 
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follows from the condition that n is the sum of two integral squares 
if and only if no prime divisor of the square-free part of n is = 3 
(mod 4).) 


Problem 19L. Show that a symmetric 2-(29,8,2) design does not 
exist. 


Problem 19M. Suppose / is a rational square matrix of order v 
and that MM' = mI. Show that if v is odd, then m is a square. 
Show that if v = 2 (mod 4), then m is the sum of two rational 
squares. 

(Note that one consequence of this latter result is that the exis- 
tence of a conference matrix of order n = 2 (mod 4) implies that 
n—1 is the sum of two squares.) 


A great deal of work has been done on the construction of 2- 
designs. We shall only treat a number of examples that will give 
some idea of the kind of methods that have been used. The smallest 
nontrivial pair (k,A) to consider is (3,1). A 2-(v,3,1) design is 
called a Steiner triple system. One uses the notation ST'S(v) for 
such a design. By (19.3) and (19.4), a necessary condition for the 
existence of such a design is that v = 1 (mod 6) or v = 3 (mod 6). 
We shall show that this condition is also sufficient. This will be done 
by direct construction in Examples 19.11 and 19.15. However, it is 
useful to see a number of examples of a more complicated approach. 
The methods that we demonstrate can be used for the construction 
of other designs than Steiner triple systems. Furthermore, they can 
be used to produce designs with certain subdesigns (see Problem 
19N) or prescribed automorphism group. The idea of this approach 
is to find direct constructions for small examples and some recursive 
constructions, and subsequently show that, for any v that satisfies 
the necessary conditions, an ST'S(v) can be constructed by the 
recursive methods, using the list of known small examples. We 
shall see below that this in fact reduces to a (not very difficult) 
problem in number theory. As stated above, we restrict ourselves 
to a number of examples. The reader may wish to try to show that 
our examples suffice to find an ST\S(v) for all possible values of v, 
without using Examples 19.11 and 19.15. 

We consider the trivial design with only one block of size 3 
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as STS(3). We have already seen constructions of ST'S(7) = 
PG (2) and ST'S(9) = AG2(3). In Example 19.1, we constructed 
an STS(15). 


Example 19.11. Let n = 2¢+1. We define P := Z, x Z3. As 
blocks we take all triples {(z,0), (2,1), (a,2)} with x € Z, and all 
triples {(x, i), (y,2), ($(a@ + y),i+ 1)} with « #y in Z, andi € Zs. 
This simple construction provides an STS(6t + 3) for every t. 


Example 19.12. Let q = 6t +1 be a prime power and let a be 
a primitive element in F,, i.e. F; is a cyclic group generated by a. 
We define 


(19.14) Bye := {a'+é, a+, at +6}, O<i<t, €€R,. 


We claim that the elements of F, as points and the blocks B;¢ form 
an ST'S(q). The idea of the proof is the same as in Example 19.6. 
Note that a® = 1, a = —1 and define s by a® = (a — 1). We 
consider the six differences of pairs from Bo. These are: 


at one es a, —(a* _ 1) = eae 
> LE 

qt es qt = 92 oe —(a*# _ a?) — q? oe. 
a _ att _ en, =(1 = ae) = att, 


It follows that for any 7 4 0 in F,, there is a unique i, 0 <7 <t, 
such that 7 occurs as the difference of two elements of B;9. Hence 
for any x and y in F,, there is a unique 7 and a unique € € F, such 
that the pair x,y occurs in the block Bj. 


The method of Examples 19.6 and 19.12 is known as the method 
of differences. Example 19.15 will show a more complicated use of 
the same idea. 

We now know that an STS(v) exists for v = 13,19, 25, 31,37, 43 
and 49 as well as the values mentioned above. This includes all 
v = 1 (mod 6) less than 50. In fact, we now know at least one 
STS(v) for each feasible value of v less than 100, except v = 55, 
v=85,v=91. 
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Example 19.13. Let there be an ST’S(v;) on the point set V; 
(i = 1,2). We take V; x V2 as a new point set and define as blocks 
all triples: {(x1, 1), (2, y2), (v3, y3)} for which 

(1) a, =x = 23 and {y1, yo, y3} is a block of ST'S(vy); 

(2) {x1, 22,23} is a block of ST'S(v,) and y = yo = 3; 

(3) {21,%2, 23} is a block of ST'S(v,) and {y1, y2, y3} is a block 

of ST'S(v). 

It is practically obvious that this defines an ST'S(v,v2). The reader 
should check that we have defined the correct number of blocks. 


This construction provides us with an ST'S(91). 


Example 19.14. We show a slightly more complicated construc- 
tion. Suppose that we have an STS(v,) on the point set Vi = 
{1,2,...,vi} with block set 5), and furthermore suppose that the 
blocks that are completely contained in V = {s+1,...,v1}, where 
s = vy, —v, form an STS(v). Let S_ be the set of triples of an 
STS(v2) on the point set Vo = {1,2,..., ve}. 

We consider a new point set 


P=V Oeil s< es, Tag xX 


This set has v + vo(v, — v) points. We introduce a set B of four 
kinds of blocks: 


(1) those of the subsystem STS(v); 

(2) {(a,y), (b,y),c} with cE V, {a,b,c} € S; and y € Va; 

(3) {(a,y), (0, y), (c, y)} with {a, b,c} a block in S; with no point 
in V, and y € V3; 

(4) {(21, 41), (2, y2), (U3, ys) f, Where {y1, ye, ys} is a block in S5 
and the integers 71, £2, #3 satisfy 


1 +22+23=0 (mod s). 


Again, one easily checks that any two points of P uniquely deter- 
mine a block in B. Hence P and B are the points and blocks of a 
Steiner triple system on v + v2(v; — v) points. A simple example 
is obtained by letting the subsystem be just one block, i.e. v = 3. 
Taking v, = 7, vo = 13, we find an ST'S(55). 


We have thus constructed an ST'S(v) for every feasible value of 
v less than 100, except v = 85. 
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Problem 19N. (a) Show that if an ST$(v,) and an ST'S(v2) both 
exist, then there is an STS (vjv2 — v2 +1). Use this construction to 
find an ST'S(85). 

(b) Construct an S7T'S(15) on the set {0,1,...,14} such that it 
contains a Fano plane on {0,1,...,6} as a subsystem. 


Example 19.15. Consider as point set Za; x Z3 U {oo}. Addition 
of elements is coordinatewise with the extra convention oo+(x,7) = 
oo. For notational convenience we sometimes write the second co- 
ordinate as an index, i.e. x; instead of (x,7). We now define four 
types of ‘base blocks’: 


(1) {0o, 01, 02}; 

(2) {00,00, ti}, {00,01,t2}, {00, 02, to}; 

(3) {0o, 1, (—2)i}, {01, 12, (—t)o}, {02, to, (—t)o}, +=1,...,t-]; 

(4) {to, “1, (1—4) 1}, {ti, 12, (1—z)o}, {to, 10, (1—a)o}, i= 1, tele 
We have 6¢ + 1 base blocks. For a = 0,1,...,¢t— 1 we add the 
element (a,0) (i.e. a9) to each of the elements of every base block, 
thus producing t(6t+1) blocks. We claim that these are the triples 
of an ST'S(6t+ 1). It is trivial that the base blocks of type 2 yield a 
set of blocks in which every pair of points, one of which is oo, occurs 
exactly once. The cyclic nature of the definition of the base blocks 
shows that it is sufficient for us to check that all pairs {ao, bo} with 
a # b and all pairs {ao,b} occur in the triples we have defined. If 
a < b and b—a = 2s, then the pair {ao, bo} occurs in the triple 
obtained from {09, 59, (—s)o} ‘translated’ by the element (b— s,0). 
Similarly, if b—a is odd, we find the required pair by translating a 
base block of type 4. Now consider a pair {ao,b;}. Ifa =b<t-—1, 
we find the pair by translating the base block of type 1 by (a,0). 
If a #6 and a < t, we have to look for the pair in a translate of a 
base block of type 2 or of type 3. We must search for a base block 
in which the difference b—a occurs as y— «x for two elements yj, Xo. 
For type 2, this difference is t and in the blocks {0p, i1, (—i)1} we 
find the differences 7, 1 <i <<t—1, and —i = 2t-1,1<i<t—-l, 
indeed every difference once! Now, the rest of the details can be 
left as an exercise. 

This example shows that if v = 6¢+ 1, then an STS(v) exists. 
Combined with Example 19.11 we have a construction for every 
feasible value of v. 
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We end this chapter with an amusing application of the Fano 
plane. At present the idea is not used in practice but the problem 
itself has a practical origin, and maybe some day generalizations of 
the following method will be used. Suppose one wishes to store one 
of the integers 1 to 7 in a so-called ‘write-once memory’. This is a 
binary memory, originally filled with zeros, for which it is possible 
to change certain bits to ones but not back again, i.e. the state 1 is 
permanent. This happens in practice with paper tape, into which 
holes are punched, or compact discs, where a laser creates pits in 
certain positions. In both cases, we cannot erase what was written 
in the memory. To store the integers 1 to 7, we need a memory of 
three bits. What if one wishes to use the memory four consecutive 
times? The simplest solution is to have a 12-bit memory that is 
partitioned into four 3-bit sections, one for each consecutive usage. 
We assume that the memory is very expensive and we would like 
to be able to use a shorter memory for the same purpose. We shall 
now show that seven bits suffice, a saving of more than 40%. 


Figure 19.3 


Let P = {1,2,...,7} be the set of points of PG2(2) and let 
£ denote the set of lines. To store one of the integers 1 to 7 ina 
memory with positions numbered 1 to 7, we use the following rules. 
As a general rule: if we wish to store i and the memory is in a state 
corresponding to i (from a previous usage), then we do nothing. 
Otherwise the rules are: 

(1) if the memory is empty, store i by putting a 1 in position i; 

(2) to store 7 when the memory is in state 7, put a 1 in position 

k, where {i,j,k} € L£; 

(3) to store i when the memory contains two 1’s, not correspond- 

ing to 2, put in two more 1’s, such that 7 is one of the four 
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1’s and the other three form a line in £. No matter what 
the two original 1’s were, this is possible (sometimes in two 
ways); 

(4) if the memory contains four 1’s, we may assume that we are 
in the situation of Fig. 19.3. To store 3, we do nothing (by 
the general rule); to store one of the missing numbers, we 
put 1’s in the other two positions; to store 1, 2, or 4, storea 1 
in the empty position on the line through 3 and the number 
we wish to store. 


We leave it as an exercise for the reader to formulate the rules 
for reading the memory. Note that the memory uniquely reads 
the integer presently stored in the memory but it cannot see how 
often an integer has been stored or what was stored on the previous 
usage. 


Problem 190. (i) Suppose A is a family of subsets of an n-set 
X, so that any member of A has odd cardinality, but such that 
any two distinct members of A meet in an even number of points. 
Show that |A| <n. 

(ii) Suppose members of A have even cardinality but any two 
intersect in an odd number of points. Show that |A] <n-+1. Can 
you find examples where equality holds? 


Problem 19P. Consider 2-(v, k, \ = 2) designs where v = Ce 
(i) Find an example with k = 3. (You might try five points on a 
circle and one in the middle.) 
(ii) Let Ai, Ao,..., A», be the blocks and for i = 2,3,...,6, let 
i; = |A;N A,|. Compute 


b 


b b 
domi Dp wi(te— 1), D(H — Ye = 2) 


i=2 
in terms of k. What can you say about the u;’s? 


Problem 19Q. A Generalized Steiner System is an incidence struc- 
ture with a point set P of size v and a set B of subsets of P, again 
called blocks, where |6| = b, such that every t-subset of P is in a 
unique block. We do not require the blocks to have constant size 
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but we rule out the trivial case where b = 1. Denote by b;,, the min- 
imal value of b for a nontrivial Generalized Steiner System. Prove 


that for t > 2 
bev (ben —_ 1) 2 (7) : 


Note that for t = 2, this result is the same as Theorem 19.1. Much 
stronger bounds are known but not as easy to derive. 


Problem 19R. We shall say that a point in the Fano plane is a 
representative of a line if it is incident with that line. How many 
SDR’s does the Fano plane have? 


Problem 198. Construct 3-(2*, 4,1) designs, for all k > 2. 


Problem 19T. We say that a design can be extended if it is a 
derived design of a design, called the extended design. Prove that 
if a symmetric design can be extended twice, it must be a 2-(21, 5, 1) 
design. 


Problem 19U. Consider the graph G of Problem 1J. Define an 
incidence structure I with as points the vertices of G and as lines 
the sets I(x), for vertices x € G. Show that the properties of G 
imply that I is a projective plane. (Cf. Problem 21Q.) 


Problem 19V. We use the hypothesis of, and the notation intro- 
duced in the proof of, Theorem 19.8. 

(i) Check that W;,N, = (1) Nj. 

(ii) Show that the product of any two of W,'W;,, i=0,1,...,98, 
is a linear combination of these s + 1 matrices. So the linear span 
A of these matrices is closed under multiplication. 

(iii) Suppose equality holds in Theorem 19.8 so that the matrix 
N, is square. Then N,) M~'N, =I where M := )>)_) 0),_;W,1 Wis. 
Since M € A, part (ii) implies M~!' € A. Use this to prove that 
there is a polynomial f(x) of degree s so that f(|AM Bl) = 0 for 
all distinct blocks A,B. (Hence there are at most s ‘intersection 


numbers’.) This generalizes Theorem 19.9. 


Notes. 
The first occurrence of a 2-design may be AG(3) in a paper by 
Pliicker (1839). One usually attributes the introduction of Steiner 
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systems to Woolhouse (1844); of course not to Steiner! Quite often 
they are said to originate with a problem of T. P. Kirkman (1847). 
T. P. Kirkman (1806-1895), a self-educated man, was a minister of 
the Church of England. He was an amateur mathematician with 
many contributions to the subject. Probably the best known is his 
15 schoolgirls problem. The problem is to arrange 15 schoolgirls 
in parties of three for seven days’ walks such that every two of 
them walk together exactly once. This amounts to constructing an 
STS(15) for which the set of triples can be partitioned into seven 
‘parallel classes’. 

Jakob Steiner (1796-1863) was an important geometer of his 
time. He became interested in what we now call Steiner systems in 
1853 when he studied the configuration of 28 double tangents of a 
plane quartic curve. 

Sir Ronald A. Fisher (1890-1962) is considered to be one of the 
most prominent statisticians. Besides important contributions to 
statistics (multivariate analysis) and genetics, he is known for his 
work on the application of statistical theory to agriculture and the 
design of experiments. The applications to the design of agricul- 
tural experiments account for our usage of v for the number of 
points of a design (varieties) and r for the number of blocks through 
a point (replication number). 

Theorem 19.7 is due to M. P. Schutzenberger (1949). 

R. A. Fisher was not the only statistician to contribute to the 
mathematical theory of designs. In fact, we should probably con- 
sider the Indian mathematician R. C. Bose (1901-1987) to be the 
most important one. Many of the construction methods described 
in this chapter (such as the method of differences) are due to him. 

G. Fano (1871-1952), whose name has become attached to the 
plane PG»2(2), was important in the Italian school of projective 
geometry. 

Projective planes over finite fields were first studied by K. G. C. 
von Staudt (1798-1867) in his book Geometrie der Lage (1856). 

The first example of a nonembeddable quasiresidual design was 
given by Bhattacharya (1944). It was also a 2-(16,6,3) design. How- 
ever, Example 19.9 is a much simpler example. 

J. L. Lagrange (1736-1813) was born and educated in Italy but 
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he is considered a French mathematician (who studied in Berlin). 
Besides many important contributions to analysis, he is known for 
several theorems in number theory. 

The Bruck—Ryser—Chowla theorem, Theorem 19.11, is so well 
known that it is often referred to as just BRC. 

The idea of Example 19.15 is due to Skolem (1958). His method 
was actually slightly different. It has led to the term Skolem se- 
quences. These have other applications, e.g. in radioastronomy. 
Here is the idea. Partition the set {1,2,...,2n} into pairs {a;, b;} 
such that 6; — a; = 71, 1 <i<n. This is a Skolem sequence. For 
example, {9,10}, {2,4}, {5,8}, {3,7}, {1, 6} is such a partition for 
n = 5. Now form the triples {0, a;-+n, b; +n} and consider these as 
base blocks (mod 6n+1). Since all the differences 1,2,...,3n and 
their negatives occur exactly once, the blocks form an ST'S(6n+1). 

For an interesting application of the Golay code (see Chapter 20) 
to write-once memories, see Cohen et al. (1986). They show the 
possibility of three successive writings of 11 bits on 23 positions. 
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20 


Codes and designs 


We introduce some more terminology from the theory of error- 
correcting codes. In the most general sense, a code of length n 
is simply a subset C C S$”, where S is a finite set (the alphabet). 
Elements of C' are called codewords. A binary code is one with 
alphabet S = {0,1}; a ternary code is one with S = {0,1,2}. 
The distance d(x, y) between two words (vectors) x and y in $” is 
defined to be the number of positions in which they differ, i.e. 


(20.1) Ux y) = {tl ots ny ae F Gh |, 


This is indeed a distance function in the usual sense; check that it 
satisfies the triangle inequality. 

The concept of distance has led to the usage of geometric termi- 
nology, e.g. the set B,(x) := {y € Fj : d(x,y) < r} is called the 
sphere with radius r and center x, though actually the name ‘ball’ 
would be better. 


The minimum distance d of the code C is 


(20.2) d:=min{d(x,y):xEC,yeC,xFy}. 


Much of coding theory is concerned with linear codes. By a q- 
ary |[n, k] code, we mean a linear subspace C' of dimension k of the 
vector space F”. 


The weight w(x) of x is defined by 


(20.3) w(x) := d(x, 0). 


This can be defined whenever 0 is one of the symbols (elements of 
the alphabet), but is especially meaningful for linear codes. When 
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C is linear, the distance between codewords x and y is equal to the 
weight of x — y, which is another codeword, and so the minimum 
distance of C' is equal to the minimum weight, i.e. the minimum of 
the weights of nonzero codewords. We use the notation |n, k, d] code 
for an [n,k] code with minimum distance at least d. If d = 2e +1, 
then C’ is called an e-error-correcting code. 

The covering radius p(C) of the code C is defined to be the 
minimal R such that the spheres with radius R and codewords as 
centers cover S”, i.e. 


(20.4) p(C) := max{min{d(x,c):cEC}:xeES"}. 


In Chapter 18, we mentioned the repetition code in {0,1}”, i.e. the 
1-dimensional subspace of F5’ containing only 0 and 1. Ifnm = 2e+1 
then every word has distance < e to exactly one codeword. So this 
code has covering radius e. The two spheres of radius e around the 
two codewords are disjoint and they cover the space. 

In general, we call a not necessarily linear code C C S” an (e- 
error-correcting) perfect code when |C| > 1 and when every x € 
S” has distance < e to exactly one codeword. This is equivalent 
to C having minimum distance d = 2e + 1 and covering radius 
e. Clearly, perfect codes are combinatorially interesting objects. 
However, they are extremely rare. 


Theorem 20.1. If C is a code in S” with distance d > 2e + 1, 
then 


(20.5) Cl ¥ (") (q-1)'<¢". 


Proor: The sum on the left-hand side of (20.5) counts the number 
of words in a sphere of radius e. 


The bound given in this theorem is known as the sphere packing 
bound or as the Hamming bound. If equality holds, then the code 
is perfect. 


Problem 20A. Show that if a [23,12,7] binary code exists, then 
this code is perfect. 
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Problem 20B. By (20.5), a binary code of length 6 and minimum 
distance 3 has at most 9 codewords. Show that equality cannot 
hold. (However, 8 is possible.) 


Two codes are called equivalent if one is obtained from the other 
by some permutation of the coordinate positions in S”. Sometimes 
this definition is extended by also allowing a permutation of the 
elements of S such as interchanging +1 and —1 when S = Fs. 

A k x n matrix G is called a generator matriz of the |n, k] code C 
if Cis spanned by the rows of G. Elementary linear algebra shows 
that C' is equivalent to a code with a generator matrix G = (I; P), 
where P issome k x n—k matrix. This is called the reduced echelon 
form for a generator. 

The dual C+ of C is defined by 


(20.6) C™ := {x € F) : Veec(x, c) = 0}. 


If H is a generator matrix for C+, then clearly 


(20.7) GOs {xe Hh: xo” =O}: 


FH is called a parity check matrix for the code C. If G = (I, P) isa 
generator matrix, then H = (—P!' I,_;) is a parity check matrix. 
If C = C+, then C is called a self-dual code. If C C C+, then C is 
called self-orthogonal. 

If C is a linear code in F’, then the extended code C is defined 
by 
(20.8) 


C — { (Cire SiGe Cre) : feigencs ey) E C, Cl +... + €n41 = O}. 


The symbol ¢,+1 is called the parity check symbol. 


Example 20.1. Let n = (q* — 1)/(q—1). Consider a matrix 
H of size k x n, with entries in F,, for which the columns are 
pairwise linearly independent. Note that this is the maximal value 
of n for which this is possible. Then H is clearly the parity check 
matrix of an [n,n — k] code with minimum distance 3. Such a 
code is called a q-ary Hamming code. If c is a codeword, then 
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|Bi(c)| =1+n(q—1) =¢". Since |C| = q"*, we see that this code 
is perfect (by (20.5)). 


Problem 20C. Let H be the ternary [4,2] Hamming code. Define 
a (nonlinear) ternary code C of length 9 with codewords 


(Lopeiy st. gas Yiecss GA) 


by requiring that oy x; ~ 0 and that (y. — %1,..-,y4— 4) isa 
codeword in H. Show that C has covering radius 1. (No ternary 
code of length 9 with covering radius 1 and fewer than 2 - 3° code- 
words is known.) 


Example 20.2. Consider the binary Hamming code C’ of length 
n= 2*—1. By definition, the dual code C+ has as generator matrix 
the k x nm matrix that has all possible nonzero vectors of length k 
as columns. Therefore, the extension of C+ is the code R’(1,k) of 
Chapter 18. C+ is usually called the simplex code of length n. We 
remark that the [8,4] extended binary Hamming code is self-dual. 


Problem 20D. Show that the covering radius of the code R(1, 2k) 
is 27-1 _ 9-1. (Hint: use the +1 representation instead of (0,1) 
and work over Q to show that the covering radius is at most this 
large. Consider the word z that is the characteristic function of 


{x © BRP : gyro +--+ + cop-102% = 1} 


to show equality.) 


We mention one easily proved bound for arbitrary codes, known 
as the Singleton bound. 


Theorem 20.2. Let C’ be any code of length n and minimum 
distance d over F,. Then 


|C| < qe. 


PROOF: From each codeword, we delete the last d — 1 symbols. 
The set of ‘shortened’ words consists of words that are pairwise 
different! There can be no more than qg”~%*! of them. 
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Example 20.3. We give an example that shows a nice connection 
with Chapter 19. Let g = 2%. Let S be a hyperoval in PG»(q). 
Using the terminology of Example 19.7, the elements of S are vec- 
tors in 3 with the property that no three of them are linearly 
dependent. We take the elements of S as the columns of a 3 by 
q+2 matrix H. If we interpret H as the parity check matrix of 
a [q+ 2,q — 1] code over F,, then this code has minimum distance 
at least 4. So by Theorem 20.2, the distance is equal to 4. This 
is one of the rare examples of equality in Theorem 20.2. Codes for 
which equality holds in the Singleton bound are called maximum 
distance separable codes (MDS codes), another unfortunate name! 
There are many open problems concerning these codes. 


If C is a q-ary [n,k] code, and if A; denotes the number of code- 
words in C’ with weight 7, then 


(20.9) A(z) := So Az! 
i=0 


is called the weight enumerator of C. Of course, Ag = 1 and A(1) = 
IC] = a. 

Problem 20E. Suppose C is a binary code, not necessarily linear, 
with length 23, minimum distance 7, and |C| = 2'*. Assume 0 € C. 
First show that C is a perfect code. Let the weight enumerator of 
C be given by (20.9). Count pairs (x,c) with e € C, w(x) = 4, and 
d(x,c) = 3, and show that A7 = 253. Then show that the weight 
enumerator of C' is in fact completely determined by the fact that 
C is a perfect code and 0€ C. 


The following theorem is one of the most useful in the theory of 
error-correcting codes. It is due to F. J. MacWilliams (1963). 


Theorem 20.3. Let C be an [n,k] code over F, with weight enu- 
merator A(z) and let B(z) be the weight enumerator of C+. Then 


li=sZ 
20.10 B(z) =q* (1+ (¢—1)z)° A (| —_——— .}. 
20.10) Be) =e "+ @-1)2"A(- 
PROOF: We only give the proof for the case q = 2. For other 
values of q, the proof is essentially the same (instead of (—1)'*Y) 
used below, one must use x((u,v)), where y is a character on F; ). 
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Define 
g(u) = So (-1)er2e). 
veky 
Then 
Solu) =D Weer = 2 eye. 
uceC ucC vek;’ veky’ uceC 


Here, if v € C+, the inner sum is |C|. If v ¢ C+, then half of the 
terms in the inner sum have value +1, the other half —1. Therefore 


(20.11) S— g(u) = |C|- B(z). 


uceC 


Now 


The result follows by substituting this in (20.11). 


Corollary. If we write B(z) = )0;_) Bjz’, then for q = 2, we find 
from (20.10): 


(20.12) By =o" ascw() Ce) 


These relations, known as the Mac Williams relations, are linearly 
independent equations for the coefficients A;, given the coefficients 
By 

Many of the known nontrivial 5-designs were found by the follow- 
ing elegant application of MacWilliams’ theorem, usually referred 
to as the Assmus—Mattson theorem (1969). Again we restrict the 
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proof to the binary case. For other qg, the theorem has an obvious 
generalization with nearly the same proof. We identify the posi- 
tions of a code of length n with the set P := {1,2,...,n}. This 
allows us to interpret a codeword in a binary code as a subset of 
P (i.e. as the characteristic function of a subset). The support of a 
codeword is the set of coordinate positions where the codeword is 
not zero. 


Problem 20F. Let C be a perfect binary e-error-correcting code 
of length n. Assume 0 is a symbol and that 0 is a codeword. Show 
that P together with the supports of codewords of weight d = 2e+1 
is an S(e+1,2e+1,n). 


Theorem 20.4. Let A be a binary |n,k,d]| code and let B := A+ 
be the dual code, an [n,n — k] code. Let t < d. Suppose the 
number of nonzero weights in B, that are less than or equal to 
n—t, is << d—t. Then for each weight w, the supports of the words 
of weight w in A form a t-design and the supports of the words of 
weight w in B form a t-design. 


ProorF: If C is any code and a t-subset T of P has been fixed, then 
we denote by C’ the code of length n — t obtained by deleting the 
coordinates in T’ from the codewords in C’. We denote by Co the 
subcode of C’ obtained by deleting the coordinates in T from the 
codewords in C' that have zeros at all positions of T’. 

The proof is in four steps. 


(i) Let T be a subset of P of size t. Since t is less than the 
minimum distance of A, the code A’ has as many codewords as A, 
i.e. has dimension k also. In fact, A’ still has minimum distance 
>d-—t. So the dual (A’)+ has dimension n — k — t. Clearly, Bo is 
a subcode of (A’)+. Since the dimension of Bo is at least n —k —t, 
we must have Bo = (A’)*. 


(ii) Let S> a;z’ and S> G;z' be the weight enumerators for A’ and 
Bo, respectively. We claim that these weight enumerators do not 
depend on the particular t-subset 7’, but only on the numbers f¢, n, 
k, and the weights of words in B. 

Let 0 < f) < lg < +--+ < ¢, <n—t, where r < d—t, be the nonzero 
weights < n—t for the code B. These are the only possible weights 
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for Bo. Then (20.12) gives 
n—t " d £:\ (n-t-£; 
nore() aera") 
Boley = ("5 ) $dom SS, 


By hypothesis, the minimum distance of A’ is > r, so we know the 
values of a; for j <r, namely ag = 1, a3 = +--+: = a,_1 = 0. Thus 
we have r linear equations in r unknowns (;,. These unknowns 
are uniquely determined if the r x r coefficient matrix M that has 
(i, j)-entry p;(¢;) where 


pj (a) = y-1" (*) eas) 


m=0 


l<i<r,0<j<r-1, is nonsingular. But p;(x) is a poly- 
nomial in x of exact degree j (the coefficient of x is (—1)/2//j!), 
so elementary column operations reduce M to the Vandermonde 
matrix 


1D). @: tee 
Ls ie DR pct 
1 0, @ grt 


which is nonsingular since the @;’s are distinct. 

Thus the weight enumerator 5~G;z' does not depend on the 
choice of the subset T. Since A’ is the dual of Bo, S> a;z' is also 
independent of T’. 


(iii) Let € be the collection of words of weight w in B, interpreted 
as subsets of P. The number of members of € that miss all coor- 
dinates in T is the number of words of weight w in Bo, and this is 
independent of T. That is, the complement of (P,€) is a t-design. 
By Problem 19C, the sets in € also form the blocks of a t-design. 
This proves the second assertion of the theorem. (Remark: This 
might be criticized if w > n —t. But in this case our argument 
when applied to t’ := n —w shows that either every w-subset is the 
support of a codeword, or no w-subset is the support of a codeword; 
so the words of weight w, if any, form a trivial t-design.) 
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(iv) To prove the first assertion of the theorem, we proceed by 
induction. We start with w = d. Let D be the collection of words 
of weight d in A. The number of sets in D that contain a given 
t-subset T of P is equal to the number of words of weight d—t in A’, 
and as we saw above, this number does not depend on the choice 
of T. So D is a t-design. Let w > d and suppose the assertion is 
true for all w’ with w > w’ > d. Now let D denote the collection 
of words of weight w in A. In this case the number of subsets in 
D that contain a given t-subset 7’, is equal to the number of words 
of weight w—t in A’ corresponding to codewords of weight w in 
A. By (iii), the total number of words of weight w —t in A’ does 
not depend on the choice of JT. By the induction hypothesis and 
(19.6), the number of words of weight w—t in A’ corresponding to 
codewords of weight less than w in A does not depend either on T. 
This proves the assertion. 


Problem 20G. What is the weight enumerator of the dual of the 
binary Hamming code of length 2" — 1? Derive an expression for 
the weight enumerator of the Hamming code itself. 


Example 20.4. Let A be the extended [8,4] binary Hamming 
code. We know that A = At. In the notation of Theorem 20.4, 
we have d = 4. Take t = 3. The conditions of Theorem 20.4 are 
satisfied. Hence the words of weight 4 form a 3-design, which is of 
course the Hadamard 3-(8,4,1) design corresponding to R(1, 3). 


Example 20.5. After the next example, we shall treat the famous 
binary Golay code and show that the corresponding extended code 
Go, is a self-dual [24, 12, 8] code with weights 0,8, 12,16 and 24. So 
Theorem 20.4 shows that the words of weight 8 in this code form 
a 5-design. From Problem 20E, we know that there are 759 blocks 
and then (19.1) shows that A = 1. This also follows from the fact 
that the code has distance 8, so two blocks have at most four points 
in common. This design is the Witt design $(5,8, 24) mentioned 
in Chapter 19. The words of other weights also yield 5-designs. 


Example 20.6. The following 5-designs, and several others, were 
found by V. Pless (1972). Consider the 18 by 18 Paley matrix C 
given by (18.5). We consider a ternary [36,18] code Symz3¢ with 
generator G = (lig C). Such a code is called a symmetry code. 
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Since C is a Paley matrix, we have GG! = O, i.e. Symsgg is a self- 
dual code. This implies that all weights in the code are divisible 
by 3. We claim that all the words in Sym3¢ have weight at least 
12. Observe that since C is symmetric, the matrix (—C Ig) is a 
parity check matrix for the code, and, because the code is self-dual, 
this means that that matrix is also a generator matrix for the code. 
If (a,b), where a and b are vectors in Fj’, is a codeword, then 
(—b, a) is also a codeword. This shows that if there is a codeword 
with weight less than 12, there is such a codeword that is a linear 
combination of at most four rows of G. These are easily checked by 
hand as follows; the reader should do it as an exercise. The fact that 
Cis a Paley matrix and the argument used in the proof of Theorem 
18.1 (and in Problem 18B) show that a linear combination of 1, 2, 
or 3 rows of G has weight 18, respectively 12, respectively 12 or 15. 
It remains to check combinations of four rows. Now use the fact 
that Q in (18.5) is cyclic, which implies that only a few essentially 
different combinations have to be examined. One can also extend 
the array used in the proof of Theorem 18.1 by one row. Both 
methods involve very little work and produce the result that the 
minimum weight is 12. (Originally this was done by computer.) 


We now use the generalization of Theorem 20.4 to ternary codes. 
If one considers the words of some fixed weight in Sym3¢ (the gen- 
eralization holds for weights 12,15,18, and 21), and if we replace 
each word by the set of positions where the nonzero coordinates 
occur, we find 5-designs. Since the codewords c and 2c yield the 
same set, we only consider this set as one block. 


We now come to the most famous of all binary codes: the binary 
Golay code Go3. There are very many constructions of this code, 
some of them quite elegant and with short proofs of its properties. 
We show only one of these constructions related to design theory. 

We consider the incidence matrix N of the (unique) 2-(11,6,3) 
design; see Problem 19D. We have NN' = 31 +3J. Consider N 
as a matrix with entries in Fy. Then NN' =I+ J. So N has rank 
10 and the only vector x with xN = 0 is 1. The design properties 
imply trivially that any row has weight 6 and that the sum of two 
rows of N has weight 6. We also know that the sum of three or 
four rows of N is not 0. 
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Next, let G be the 12 by 24 matrix over F, given by G = (Ij2 P), 
where 


os 


(20.13) P=|. 
1 


Every row of G has a weight = 0 (mod 4). Any two rows of G have 
inner product 0. This implies that the weight of any linear com- 
bination of the rows of G is = 0 (mod 4); prove this by induction. 
The observations made about N then show that a linear combina- 
tion of any number of rows of G has weight at least 8. Consider 
the code generated by G and call it Go4. Delete any coordinate to 
find a binary [23,12] code with minimum distance at least 7. By 
Problem 20A, this code must have minimum distance equal to 7 
and, furthermore, it is a perfect code! We denote it by G3. These 
two notations will be justified below, where we show that Go4, the 
extended binary Golay code, is unique, from which the uniqueness 
of Go3 can be shown. 


Theorem 20.5. If C is a binary code of length 24, with |C| = 2", 
minimum distance 8, and if 0 € C, then C is equivalent to G24. 


Proor: (i) The difficult part of the proof is to show that C' must 
be a linear code. To see this, observe that deleting any coordinate 
produces a code C’ of length 23 and distance 7 with |C’| = 2. 
By Problem 20E, the weight enumerator of this perfect code is de- 
termined: Ag = Ao3 = 1, Az = Aig = 253, Ag = Ais = 506, 
Ay, = Ay = 1288. From the fact that this is the case, no mat- 
ter which of the 24 positions is deleted from C, it follows that 
all codewords in C’ have weight 0,8,12,16, or 24. Furthermore a 
change of origin (i.e. adding a fixed codeword to all the codewords) 
shows that we can also infer that the distance of any two code- 
words is 0,8,12,16, or 24. Since all weights and all distances are 
= 0 (mod 4), any two codewords have inner product 0. Therefore 
the words of C' span a code that is self-orthogonal. However, such 
a code can have at most 2'* words. Therefore, C itself must be a 
linear and self-dual code. 
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(ii) We form a generator matrix of C' by taking as first row any 
word of weight 12. So after a permutation of positions, we have: 


ES hc Ae 29 ae 0 
G=( A B if 


We know that any linear combination of the rows of B must have 
even weight 4 0, so B has rank 11. Therefore the code generated 
by B is the [12, 11, 2] even-weight code. We conclude that we may 
assume that B is the matrix [;, bordered by a column of 1’s. 
Another permutation of columns yields a generator matrix G" of 
the form (liz P), where P has the same form as in (20.13). What 
do we know about the matrix N? Clearly any row of N must have 
weight 6. Furthermore, the sum of any two rows of N must have 
weight at least 6. By Problem 19D, N is the incidence matrix of 
the unique 2-(11,6,3) design, i.e. C' is equivalent to G4. 


As we saw in Example 20.5, the words of weight 8 in Go, form the 
blocks of the Witt design D = S(5,8,24). Denote by {0,1,...,23} 
the point set of D and consider J = {21, 22,23}. By the Corollary 
to Theorem 19.3, D; is an $(2, 5,21), i.e. a projective plane of order 
4 (which is also known to be unique). The following problem shows 
the beautiful combinatorial structure of the design D. 


Problem 20H. Let B bea block of D with |BNJ| = a and define 
B* := B\I. We saw that if a = 3, then B* is a line in PG2(4). 
Show that 

(i) a = 2 implies that B* is a hyperoval in PG2(4); 

(ii) ~w = 0 implies that B* is the symmetric difference of two lines. 

(The ambitious reader may wish to show that if a = 1, then 
the seven points of B* and the lines of PG2(4) containing at least 
two such points, form a Fano plane. Such a plane is called a Baer 
subplane of PG2(4).) 

By counting the hyperovals, pairs of lines, etc., one can show that 
each of the geometric configurations mentioned above is one of the 
sets B*. In fact, one of the well known constructions of $(5, 8,24) 
starts with these objects and produces the design by appending 
suitable subsets of J. If one does not use the automorphism group 
of PG2(4) in the argument, then this construction is a nontrivial 
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combinatorial problem. (Again: the ambitious reader should try 
it.) 
Problem 201. Let F,; = {0,1,w,@}. Let C be the [6,3] code over 
F, with codewords (a, b,c, f(1), f(w), f(@)), where f(x) := ax? + 
bx + ¢. 

(i) Show that C has minimum weight 4 and no words of weight 
5. 

(ii) Let G be the binary code with as codewords all 4 by 6 (0,1)- 
matrices A with rows ao, a), a,, a> such that: 


(1) every column of A has the same parity as its first row ag, 
(2) a, twa, +Waz EC. 
Show that G is a [24, 12,8] code, ie. G = Gay. 


Problem 20J. As in Example 20.6, we construct a ternary code 
Sym by using the Paley matrix C of order 6. Show that Sym 
is a [12,6,6] self-dual code. Puncture the code (i.e. delete some 
coordinate) to obtain a [11,6,5] ternary code Gj. Show that this 
code is perfect. It is the ternary Golay code. 


We have now given several examples of designs constructed by 
using a suitable code. We reverse the procedure and study codes 
generated by the (characteristic functions of the) blocks of a design. 
Let N be the incidence matrix of a projective plane of order n. We 
consider the subspace C of F), where v = n?+n+1, generated by 
the rows of N. If n is odd, C is not very interesting. Namely, if we 
take the sum of the rows of N that have a 1 in a fixed position, the 
result is a row with a 0 in that position and 1’s elsewhere. These 
vectors generate the [v, v — 1, 2] even-weight code and this must be 
C’, since C' obviously has no words of odd weight. If n is even, the 
problem becomes more interesting. We restrict ourselves to n = 2 
(mod 4). 


Theorem 20.6. If = 2 (mod 4), the rows of the incidence ma- 
trix N of a projective plane of order n generate a binary code C' 
with dimension $(n? +n + 2). 


PROOF: (i) Since n is even, the code C is self-orthogonal because 
every line has an odd number of points and any two lines meet in 
one point. Therefore dimC < $(n? +n + 2). 
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(ii) Let dimC = r and let k:=n?+n+1-—r=dimC?. Let 
HT be a parity check matrix for C. Assume that the coordinate 
places have been permuted in such a way that H has the form 


(I, P). Define A := (3 ey Interpret the (0,1)-matrices N and A 


as matrices over Q. Then 
det NAT = det N = (n+ 1)n2*), 


Since all entries in the first k columns of NA! are even integers, 
det N is apparently divisible by 2°. So $(n? +n) >k,ier > 
s(n? +n+2). 

The result follows from (i) and (ii). 

The theorem shows that the code C’ generated by the rows of N 
has the property that C is self-dual. We shall now show an even 
more interesting property of this code, namely that one can recover 
the plane from the code. 


Theorem 20.7. The code C of Theorem 20.6 has minimum weight 
n+1 and every codeword of minimum weight corresponds to a line 
in the plane of order n. 


ProorF: As before, we interpret a codeword as a subset of the 
plane. So it will be clear what we mean by saying that a point is 
on the codeword c. Let c be a codeword with w(c) = d. Since n is 
even, the codewords corresponding to lines in the plane have a 1 as 
parity check symbol in the extended code C. This code is self-dual 
and this implies: 

(1) if d is odd then c meets every line at least once; 

(2) if d is even then every line through a fixed point of ¢ meets 

c in a second point. 


In case (2) we immediately see that d > +1. In case (1) we find: 
(nt+tl)d>n?+n4+1,ie.d>n+1. If w(c) =n+1, then there 
is a line L of the plane that meets c in at least 3 points. If some 
point of L is not a point of c, then every line 4 L through that 
point must meet c by (1). This would imply that d>n+3. Soc 
must be the line L. 


Recall that in a projective plane of even order n, a hyperoval is 
a set of n + 2 points such that no three are on one line. 
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Theorem 20.8. The codewords of weight n+ 2 in the code C' of 
Theorem 20.6 are precisely all the hyperovals of the plane. 


ProorF: (i) Let v € C and w(v) = n+2. Every line meets v 
in an even number of points. Let L be a line of the plane and 
suppose that v and L have 2a points in common. Each of the n 
lines £ L through one of these 2a points meets v at least once 
more. Therefore 2a+n<n+2,ie.a=Oora=1. 

(ii) Let V be a hyperoval. Let S' be the set of Ce?) secants of V. 
Each point not in V is on $(n + 2) such lines; each point of V is 
on n+ 1 secants. Since n = 2 (mod 4), the sum of the codewords 
corresponding to secants is the characteristic function of V. So this 
is a codeword. 


Theorems like the previous two yielded enough information about 
the code corresponding to a projective plane of order 10 to make 
possible the computer search that we mentioned earlier. The first 
important step was due to F. J. MacWilliams, N. J. A. Sloane, 
and J. G. Thompson (1973) who showed that this code could have 
no words of weight 15. Presently, the only known projective plane 
with order n = 2 (mod 4) is the Fano plane. 


Problem 20K. Find a 4x39 matrix of 0’s, 1’s, and —1’s so that (i) 
no column is all zero, and no column is +1 times another column, 
and (ii) each row contains 13 of each of 0,1, —1. 

Generalize to construct r by $(3” —1)—1 matrices with property 
(i) and (ii) (with 13 replaced by an appropriate value). 

(The matrix above can be used for the following puzzle. Of 39 
coins, one is counterfeit and weighs either more or less than the 
others. You have access to a balance with two pans; some coins 
can be put in one pan and some in the other, and the balance will 
tell you which subset has greater total weight. You must determine 
the counterfeit coin, and whether it is lighter or heavier, using a 
total of four weighings. ) 


Problem 20L. Korner and Simonyi called a code over an alphabet 
of gq > 2 symbols trifferent if the codewords were not only different 
but for each triple of codewords there is a position where they have 
three different entries. Define F'(n,q) to be the maximal number 
of codewords in a trifferent code of length n. Prove that F'(n,q) > 
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6-1/?(q/3)"?. (Hint: Use Theorem 3.6. Consider a random code 
with N codewords. For each 3-subset S of {1,2,...,} let Ag be the 
event that the three codewords with indices in S are not all different 
or not trifferent. Show that As has probability (3q(q—1) +q)"/q", 
etc.) 


Notes. 

We have not been concerned with the error-correcting properties 
of codes in this chapter. For a systematic treatment of coding 
theory we refer to Van Lint (1999). Much more material on the 
relation between design theory and coding theory can be found in 
Cameron and Van Lint (1991). The best reference book for coding 
theory is MacWilliams and Sloane (1977). 

For some of the history of the origin of coding theory, we refer to 
Thompson (1983). There is some controversy about priority, but it 
is clear that both R. W. Hamming and M. J. E. Golay contributed 
to the ‘discovery’ of this fascinating topic in 1947 and 1948. The 
monumental paper by C. E. Shannon (1948) really started things 
going. It seems that Hamming was irritated by the fact that his 
computer kept stopping when it detected an error. He correctly 
decided that if it could detect errors, it should be able to locate 
them and correct them and then get on with the job! Golay pub- 
lished the two Golay codes by giving their generator matrices but 
without proof of the properties. 

E. F. Assmus (1931-1998) died tragically at a meeting on Codes 
and Designs in a place he loved: the Mathematisches Forschungsin- 
stitut in Oberwolfach. A large part of this chapter is based on his 
many contributions to the relations between coding theory and de- 
sign theory (often written in collaboration with H. F. Mattson). 

M. J. E. Golay (1902-1989) was a Swiss physicist who worked in 
many different fields. He is known for his work on infrared spec- 
troscopy and the invention of the capillary column but to mathe- 
maticians mainly for his discovery of the two Golay codes. 

For more about MDS codes, we refer to MacWilliams and Sloane 
(1977). 

F. J. MacWilliams (1917-1990) made many contributions to cod- 
ing theory. Of these, the theorem known by her name is the most 
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important. Her book with N. J. A. Sloane is the most important ref- 
erence book on coding theory. As is the case with many important 
coding theorists, she spent most of her career at Bell Laboratories. 

As mentioned earlier, there are many constructions of the Golay 
code G4. Each of them shows that some group of permutations 
is contained in the automorphism group of the code. For example, 
one of the most common constructions displays an automorphism 
of order 23. Since the code is unique, its full automorphism group 
must contain all these groups as subgroups. In this way, one can 
prove that this automorphism group is the famous Mathieu group 
M4. This group is of order 24 - 23 - 22 -21-20-16-3 and acts 
5-transitively on the 24 coordinate positions of the codewords. 

It has been shown that there are no other perfect e-error-correct- 
ing codes with e > 2 than the ones mentioned in this chapter. For 
more on this subject, see Van Lint (1999). Also see Chapter 30. 
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Strongly regular graphs 
and partial geometries 


A strongly regular graph srg(v,k,,) is a graph with v vertices 
that is regular of degree k and that has the following properties: 


(1) For any two adjacent vertices x,y, there are exactly X ver- 
tices adjacent to x and to y. 

(2) For any two nonadjacent vertices x,y, there are exactly 
vertices adjacent to x and to y. 


A trivial example is a pentagon, an srg(5,2,0,1). Perhaps the 
most famous example is the graph of Fig. 1.4, the Petersen graph, 
an srg(10, 3,0, 1). 

Clearly a graph that is the union of m complete graphs K;, is an 
srg(km, k—1,k—2,0). Sometimes we shall exclude trivial examples 
by requiring that a strongly regular graph and its complement are 
connected, i.e. we assume 


(21.1) Ope kak 


(We remark that the fact that ~ = 0 implies that the graph is a 
union of complete graphs is most easily seen from (21.4) below.) It 
is not difficult to see that the complement G of an srg(v,k, A, ) G 
is an 


(21.2) srg(v,u —k-—1,v—2k+p—2,v—2k +4) 


and since the parameters are nonnegative, we find a simple condi- 
tion on the parameters, namely 


(21.3) v—2kt+p—-2>0. 
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Another relation between the parameters is easily found as follows. 
Consider any vertex x and partition the other vertices into the set 
I(x) of vertices joined to x and the set A(x) of vertices not joined 
to x. By the definition of strongly regular graphs, I(x) consists of 
k vertices, each of which is joined to A vertices of [(a). Each vertex 
in A(x) is joined to p vertices in I(x). Counting edges with one 
end in ['(a) and one end in A(z) in two ways, we find 


(21.4) k(k-A-1)=p(v—k-1). 


Problem 21A. Show that a strongly regular graph is extremal 
in the following sense. Let G be a graph with v vertices, each of 
degree at most k. Suppose that any two adjacent vertices, respec- 
tively nonadjacent vertices, have at least A, respectively j4, common 
neighbors. Then 


k(k-1—2) > uv -—k-1) 


and equality implies that G is strongly regular. 


Before going into the fascinating theory of these graphs, we men- 
tion several classes of examples. 


Example 21.1. The triangular graph T(m), m > 4, has as ver- 
tices the 2-element subsets of a set of cardinality m; two distinct 
vertices are adjacent if and only if they are not disjoint. T(m) is 


an srg((‘}),2(m — 2),m— 2,4). The Petersen graph is T(5) (see 


Problem 1A). 


Example 21.2. The lattice graph L2(m), m > 2, has as vertex set 
S x S, where S is a set of cardinality m; two distinct vertices are 
adjacent if and only if they have a common coordinate. L2(m) is 
an srg(m?,2(m —1),m — 2,2). L2(2) is a quadrangle which is a 
trivial example because its complement is not connected. 


Example 21.3. Let q be a prime power with gq = 1 (mod 4). The 
Paley graph P(q) has the elements of F, as vertices; two vertices 
are adjacent if and only if their difference is a nonzero square in 
F,. That this is an srg(q,3(q — 1), 4(q — 5), <(q — 1)) is a direct 
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consequence of (18.4), but it is easier to show this using the matrix 
Q of (18.5) as we shall see below. Note that P(5) is the pentagon. 


Example 21.4. The Clebsch graph has as vertices all subsets of 
even cardinality of the set {1,...,5}; two vertices are joined if and 
only if their symmetric difference has cardinality 4. This is an 
srg(16,5,0,2). One can also describe the vertices as the words of 
even weight in F), with an edge if the distance is 4. For any vertex 
x, the induced subgraph on A(z) is the Petersen graph. See Fig. 
21.1 below. 


Figure 21.1 


Let us define the adjacency matriz A of a graph G with v vertices 
1,...,u to be the v x v (0,1)-matrix with a;; = aj; = 1 if and only 
if the vertices i and j are joined. Clearly A is symmetric with 
zeros on the diagonal. The statement that G is an srg(v,k, A, 1) is 
equivalent to 


(21.5) AJ=kJ, A?+(w-AAt(w—k)I =p. 


If Q is the matrix occurring in (18.5), then by (18.4) we have Q? = 
qi — J, and the adjacency matrix of the graph P(q) defined in 
Example 21.3 is A = 3(Q+ J—I). So A satisfies (21.5) with 
k= ng 1), A= i(q—5), b= se =): 

We have already seen one connection between strongly regular 
graphs and topics treated in earlier chapters. Several more will 


264 A Course in Combinatorics 


follow. The theory of strongly regular graphs shows a completely 
different aspect of graph theory from what we studied in Chapters 
1-4 and 8. We now rely heavily on algebraic methods. Nevertheless, 
we also wish to show some nice counting arguments that are used 
in this theory. 

We define the Bose—Mesner algebra A of an srg G to be the 3- 
dimensional algebra 2 of linear combinations of J, J, and A. That 
this is indeed an algebra is a consequence of (21.5). This algebra 
consists of symmetric commuting matrices and therefore there is 
an orthogonal matrix that simultaneously diagonalizes them. This 
can also be seen in an elementary way from (21.5). In fact, we shall 
see in the next theorem that A has three distinct eigenspaces in R” 
and each of them is an eigenspace for any element of 2. 


Theorem 21.1. If there is an srg(v,k, X, 1), then the numbers 


_1f, 1. @=DW-a)-2% 
v zi 1+ ae 


and 


maf fe-t (v—1)(u— A) — 2k 
: J(u = AP + Ak = 1) 


are nonnegative integers. 


Proor: Let A be the adjacency matrix of the graph. By (21.5) 
the all-one vector j:= (1,1,...,1)' is an eigenvector of A, with 
eigenvalue k, and of course it is also an eigenvector of J and of 
J. Application of (21.5) yields a second proof of (21.4). The mul- 
tiplicity of this eigenvalue is one because the graph is connected. 
Any other eigenvector, say with eigenvalue x, is orthogonal to j and 
therefore we find from (21.5), 


e+ (w— Aa + (u—k) = 0. 


This equation has two solutions 


(21.6) rs=5{\-nt VO—nP tae} 
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Let f and g be the multiplicities of r and s as eigenvalues of A. 
Then we have 


1+ft+g=v and tr(A)=k+fr+gs=0. 


If we solve these two linear equations, we find the assertion of the 
theorem. 


Note that the multiplicities can also be expressed as 


_ ~h(s + Ik 8) a ga ED 
(k+rs)(r—s) (k+rs)(r—s) 


(21.7) ff 


From (21.6) we can draw a further (surprising) conclusion. If 
f #4, then the square root in the denominator of the expressions 
for f and for g must be an integer, i.e. (u — A)? + 4(k — p) is a 
perfect square. It then follows from (21.6) that the eigenvalues r 
and s are integers! 

The other case, i.e. when f = g, is usually called the half-case. 
We then have an srg(4y + 1,24, 4— 1,1). The Paley graphs are 
examples of the half-case. In his paper on conference telephony, 
mentioned in Chapter 18, Belevitch observed that a necessary con- 
dition for the existence of a conference matrix of order n is that 
n —1 is the sum of two squares. See Problem 19M. We note that 
the parameters v = 21, k = 10, AX = 4, w = 5 satisfy all the neces- 
sary conditions for the existence of a strongly regular graph that we 
stated above, but the graph does not exist because, using (18.5), it 
would imply the existence of a conference matrix of order 22 and 
since 21 is not the sum of two squares, this is impossible. 

The condition of Theorem 21.1 is known as the integrality con- 
dition. We shall call a parameter set (v,k, 2,4) that satisfies these 
conditions and the earlier necessary conditions, a feasible set. 


Problem 21B. Show that if an srg(k? + 1,k,0,1) exists, then 
k =1,2,3,7 or 57. (See the notes to Chapter 4.) 


We have seen that the adjacency matrix of an srg(v,k, A, w) has 
three eigenvalues, one of which is k. There is a partial converse: 
If G is a connected regular graph of degree k with an adjacency 
matrix A with exactly three distinct eigenvalues, then G is strongly 
regular. This is Problem 31F of Chapter 31. 
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To obtain some more examples of strongly regular graphs, we 
consider another connection with an earlier topic, namely designs. 
This idea is due to J.-M. Goethals and J. J. Seidel (1970). A 2- 
design is called quasisymmetric if the cardinality of the intersection 
of two distinct blocks takes only two distinct values, say x > y. We 
introduce a graph, called the block graph of the design; the vertices 
are the blocks of the design and two vertices are adjacent if and 
only if their intersection has cardinality y. 


Theorem 21.2. The block graph of a quasisymmetric design is 
strongly regular. 


PRooF: Let N be the v x b incidence matrix of the design and 
A the adjacency matrix of its block graph G. We have (using the 
parameters v, k, b, r, of the 2-design): 


NN! =(r=—A)IE +A, 
N'N=kI+yA+2a(J—I- A). 


(The first equation is (19.7), the second is the definition of A.) 
We know that both NN' and N'N have all-one eigenvectors j (of 
different lengths!) with eigenvalue kr. Also, we know that NN‘ 
has only the eigenvalue r—A on j+, with multiplicity v—1. Therefore 
N'N has this same eigenvalue, with the same multiplicity, and the 
eigenvalue 0 with multiplicity 6—v. Since x # y, A is a linear 
combination of J, J, and N'N. Therefore A has eigenvector j and 
only two eigenvalues on the space j+. They are (r—\+k+2)/(y—2) 
with multiplicity v — 1 and (a —k)/(y— x) with multiplicity b—v. 
By our observation above, G is strongly regular. 


Now that we know that G is strongly regular, it is easy to calcu- 
late the parameters. We do not give the distasteful formulae. 


Example 21.5. Consider the S(5,8,24) of Chapter 20. We fix 
two points and consider the residual design with respect to these 
two points. This is a 3-(22,6,1). From Problem 20F we know that 
the 21 blocks of this design that contain a given point are the lines 
of PG2(A4) (i.e. after the point is removed) and the 56 blocks that 
do not contain that point are hyperovals in this plane. They form a 
2-(21,6,4) design. From the properties of $(5,8, 24) it follows that 
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any two of these hyperovals meet in 0 or in 2 points. Therefore, 
this derived design is a quasisymmetric design. From Theorem 21.2 
it follows that the block graph of the design is a (56,k, A, w). Here, 
k = 10 because there are 45 = (4—1)- e) hyperovals that meet 
a given hyperoval in 2 points. From the design properties or from 
the conditions on strongly regular graphs, one then easily finds that 
A = 0, uw = 2. The graph constructed in this example is known as 
the Gewirtz graph. Note that if A is the adjacency matrix of the 
Gewirtz graph, then by (21.5) we have (I + A)? = 91 + 2J, and 
that means that N := J+ A is the incidence matrix of a 2-(56,11,2) 
design (a so-called biplane). So in this case, we also find a new 
design from the strongly regular graph. 


Problem 21C. Let D := (P,6,1) be the 3-(22,6,1) design men- 
tioned above; it is unique. We form a graph with as vertex set 
PUBU {co}. The vertex oo is joined to every element of P. An 
element of P is joined to an element of B if they are incident. Fi- 
nally, two elements of B are joined if and only if the blocks are 
disjoint. Show that this defines an srg(100, 22,0,6). This graph is 
called the Higman-Sims graph. 


If at this point we were to make a list of feasible parameter sets 
for strongly regular graphs, then the list would contain many sets 
that do not correspond to such a graph. We shall prove a few of the 
theorems that exclude these sets. Again, the methods are algebraic. 
As preparation, the reader should try to do the following problem 
using purely combinatorial arguments. 


Problem 21D. Show that an srg(28,9,0,4) does not exist. 


That this graph does not exist is a consequence of the next the- 
orem, known as the Krein condition. 


Theorem 21.3. Let G be a strongly regular graph with adjacency 
matrix A, having eigenvalues k, r, and s. Then 


(r+ 1)(k+r42rs) < (k+r)(s +1)’, 


and 
(st1)\(kK+s4+2sr) < (k+s)(r+1)’. 
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Proor: Let B := J—I—A. We know that the matrices of 2 have 
three common eigenspaces of dimension 1, f, and g, respectively. 
Call these spaces Vo, Vi, and V2. Here Vo is the space spanned by 
j, and V; and V2 correspond to the eigenvalues r and s of A. For 
i = 0,1,2 let E; be the matrix of the projection onto Vj, i.e. E; 
has eigenvalue 1 on V; and 0 on the other two eigenspaces. These 
matrices are what is called a basis of minimal idempotents of 2. 
We now consider the same set 2, but as multiplication we take the 
Hadamard product (see Problem 21E). It is obvious that any two 
of I, A, and B have product O. Any (0,1)-matrix is idempotent for 
Hadamard multiplication. So we conclude that 2 is closed under 
Hadamard multiplication and that the matrices J, A, and B area 
basis of minimal idempotents. 
Note that the definition of EF; (i = 0,1,2) implies that 


l=E£:o+F, 4+ £o, A=kEj+rE, + sEp, 


and 
B=(v-—k—-1)£o+ (-r-DE, 4+ (-s -1)kp. 


From this, we can express the F; in J, A, and B. 
Consider the behavior of the matrices E; under the Hadamard 
product. Since they form a basis of 21, we have 


2 


k=0 


where qh is the eigenvalue of E; o KE; on V;. It is a tedious cal- 
culation, but the numbers qh can be expressed in terms of the 
parameters of G using the relations given above. 

At this point we need the result of Problem 21E. The Hadamard 
product E; o £; is a principal submatrix of the Kronecker product 
EF; ® E;. This matrix is idempotent, hence its eigenvalues are 0 and 
1. By the theorem alluded to in Problem 21E, the eigenvalues qi 
have to be between 0 and 1. It turns out (after one has done all the 
calculations) that all except two of the inequalities that one finds 
in this way are satisfied. These two are q}, > 0 and qj) > 0. These 
are the two equations of the assertion. 
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Problem 21E. Let A and B be two symmetric n x n matrices 
with eigenvalues A1,...,An, respectively W1,..., /n. Determine the 
eigenvalues of A ® B. We define the Hadamard product Ao B of 
A and B to be the matrix with entries a;;b;;. Show that Ao B is 
a principal submatrix of A ® B. What can you conclude about the 
eigenvalues of Ao B? 


To appreciate the next theorem, the reader should first convince 
himself that the parameter set (50,21,4,12) is feasible and that it 
satisfies the Krein conditions. What should one count to show 
that the set nevertheless does not correspond to a strongly regular 
graph? 


Theorem 21.4. Let k,r,s be the eigenvalues of the adjacency 
matrix A of an srg(v,k, 2,1) and let the multiplicities be 1, f and 
g. Then 


vs Sff+3) and v<59(9 +8). 


Proor: Let B be J —I—A and let the matrices E; (i = 0,1, 2) 
be as in the proof of the previous theorem. Let 


FE, =al+ BA+7B. 


Since FE, is symmetric, there is an orthogonal matrix (H, kK) such 


that a. yt 
E, = (HM, K,) € a) (cr = HH). 
1 


Here H; is anv x f matrix with HI, = I. We consider the rows 
of H, as v vectors in R/. It follows that each of these vectors has 
length a? and any two distinct vectors from this set (call it S) have 
inner product ( or y. Such a set is called a spherical 2-distance set 
because we can interpret S as a set of points on a sphere with only 
two distinct (angular) distances. We must show that the cardinality 
of S is at most 5f(f +3). 

We normalize and obtain a set S’ of v vectors on the unit sphere 
Q in R/ with only two inner products, say b and c. For every v € S’ 
we define a function fy, : Q — R by 


((v, x) — b)((v, x) — ©) 
(1 —b)(1—c) ; 


fv(x) = 


270 A Course in Combinatorics 


These functions are polynomials of degree 2 in the coordinates of 
x. If{fve S,weS,v 4w, then f\(v) = 1 and f,(w) = 0. 
Therefore these functions are linearly independent. The spaces of 
homogeneous linear and quadratic forms on 2) have dimensions f 
and $f(f +1) respectively; since aj +---+ xv = 1 on Q, we can 
express constants in forms of degree 2 and 1. From the linear 
independence of the functions fy, it follows that there can be at 
most f + 3f(f +1) = 4f(f +3) of them. 
This theorem is known as the absolute bound. It was shown by 
A. Neumaier (1980) that the absolute bound can be improved to 


1 
u< af lf +1) unless qj, = 0 


(and similarly for the other inequality). 

Before turning to the relation between strongly regular graphs 
and certain incidence structures, we show that nice counting argu- 
ments also play a role in this area. 


Theorem 21.5. Let G be a strongly regular graph with the same 
parameters as T(n), i.e. G = srg((5),2(n — 2),n— 2,4). Ifn > 8, 
then G is isomorphic to T(n). 


PROOF: Fix a vertex x and denote by [I the induced subgraph on 
T(z). This is a regular graph on 2(n — 2) vertices, with degree 
n—2. Let y and z be nonadjacent vertices of I and let there be m 
vertices in I adjacent to both. Since w = 4 and z is adjacent to y 
and z, we have m < 3. In the graph IT, there are n — 2 — m vertices 
adjacent to y but not to z, and the same number adjacent only to 
z. Hence there are m — 2 vertices adjacent to neither of them. So 
m > 2. Suppose m = 38. Consider the unique vertex w adjacent 
to neither y nor z. Every vertex adjacent to w in I is adjacent to 
y or to z, which implies that n — 2 < 3+ 3 = 6, a contradiction. 
Hence m = 2, and we also see that there are no triangles in the 
complement I’. 

We now show that I is bipartite. On the contrary, assume there 
is a circuit of odd length in this graph. Choose such a circuit 
C = (%0,%1,---,2% = Xo) with k minimal. From the argument 
above we know that k 4 3. In the graph I, the vertices x and x1 
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are nonadjacent and are both adjacent to x3, %4,...,%,_2. By the 
argument above, k < 6 and since k is odd, k = 5. The vertex x9 
has degree n — 3 in I, ice. it is adjacent to n — 5 vertices, besides 
x, and x4. These must be nonadjacent in I to both x; and x4. 
A similar assertion can be made for x2, yielding n — 5 vertices 
outside C’ nonadjacent in I to both x; and x3. There are exactly 
n — 4 vertices outside C’ nonadjacent to x1; hence at least n — 6 
vertices nonadjacent in I to both x3 and x4. The result of the first 
paragraph implies that n — 6 < 1, a contradiction. 

The result of the second paragraph means that T contains two 
disjoint cliques (sets of vertices, any two of which are adjacent) of 
size n — 2. Since x was arbitrary, we have shown that any vertex 
in G lies in two cliques of size n — 1 (cliques of this size will be 
called grand cliques). The same argument shows that any edge 
is in one of these grand cliques. The number of grand cliques is 
2(5)/(n — 1) =n. Since any two grand cliques have at most one 
vertex in common, they must have exactly one vertex in common. 
If we consider grand cliques as ‘points’ and vertices as ‘blocks’, then 
we have just shown that these points and blocks form a 2-(n, 2, 1) 
design, i.e. the trivial design of all pairs from an n-set. G is the 
block graph of this design, i.e. G is indeed isomorphic to Tn). 


We remark that the theorem can also be proved, by case analysis, 
for n < 8 and that for n = 8 there are three other graphs with the 
parameters of T(8), known as the Chang graphs. 


Problem 21F. Let G be a strongly regular graph with the same 
parameters as Lo(n), ie. G = srg(n”,2(n—1),n—2,2). Prove that 
if n > 4, then G is isomorphic to Lo(n). 


R. C. Bose (1963) studied large cliques in more general strongly 
regular graphs. This led him to the concept of a partial geometry. 
A partial geometry pg(K, R,T) is an incidence structure of points 
and lines with the following properties: 


(1) every line has K points and every point is on R lines; 

(2) any two points are incident with at most one line; 

(3) if point p is not on line L, then there are exactly T lines 
through p that meet L. 
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If two points x and y are on a line, we say that they are collinear 
and write x ~ y. 


Problem 21G. Determine the number of points and lines of a 
partial geometry pg(K, R,T). 


By interchanging the roles of points and lines in a pg(K, R,T), 
we find the so-called dual partial geometry, a pg(R, K,T). 

We introduce the point graph of a partial geometry as the graph 
with the points of the geometry as vertices, and an edge {z, y} if 
and only if x ~ y. 


Problem 21H. Show that the point graph of a pg(K,R,T) is a 
(possibly trivial) srg(v,k, A, w), with: 


(kK —1)(R—-1) 
T 


v=K (14 i k= R(K —1), 


A= (kK —2)4+(R-1)(T-1), = RT, 
r=K-1-T, s=—R. 


If a strongly regular graph has parameters such that it could be 
the point graph of some partial geometry, then the graph is called 
pseudo-geometric, and it is called geometric if it is indeed the point 
graph of a partial geometry. The idea of introducing grand cliques 
and then showing that the grand cliques and the points form a 
design, as used above, was used by Bose to prove the following 
theorem. 


Theorem 21.6. If a strongly regular graph is pseudo-geometric, 
corresponding to pg(K, R,T), and if2K > R(R—-1)+T(R+1)(R?- 
2R +2), then the graph is geometric. 


We do not give a proof here. The ideas of this proof were ex- 
tended by A. Neumaier (1979), and, after an improvement by A. 
E. Brouwer, his result obtained the following form (known as the 
claw bound). 
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Theorem 21.7. If, for a strongly regular graph with the usual 
parameters, pp # s* and up # s(s+1), then 2(r+1) < s(s+1)(w+1). 


The idea of the proof is to show that if r is large, then the graph 
is the point graph of a partial geometry; this is done by counting 
arguments. Then the absolute bound and Krein conditions are 
applied to the point graph of the dual partial geometry. These 
show the inequality stating that r cannot be too large. 

The parameters (2058, 242,91, 20) are feasible and satisfy all the 
other necessary conditions stated in this chapter, except the claw 
bound. So a graph with these parameters does not exist. 


Problem 211. Consider an srg(v,k,A,1). Show that the induced 
subgraph on I(x) is a union of cliques. Count the number of (A+2)- 
cliques in the graph. Thus show that both k/(A+1) and vk/{(A + 
1)(A + 2)} are integers. Apply to the set (209,16,3,1). 


The partial geometries can be divided into four classes: 

(1) A partial geometry with T = K is a 2-(v, K,1) design. 

(2) A partial geometry with T = R—1 is called a net; dually, if 
T = K —1 we speak of a transversal design. 

(3) A partial geometry with T = 1 is called a generalized quad- 
rangle; the notation GQ(Kk —1, R—1) is commonly used for 
a pg(K, R,1). 

(4) If 1 < T < min{kK — 1,R-— 1}, then we call the partial 
geometry proper. 


Example 21.6. Consider the affine plane AG2(n). From Problem 
19K we know that the lines can be divided into equivalence classes 
of parallel lines, each containing n lines. Consider all the points of 
the plane and take the lines of m parallel classes. Then it is clear 
that these form a pg(n,m,m — 1), ie. a net. 


Example 21.7. Consider a Latin square of order n. Let the n? 
cells be the vertices of a graph; two vertices are joined by an edge 
if and only if they are in the same row or in the same column, 
or if they have the same entry. The graph is regular, with degree 
3(n — 1). If two vertices are joined, we may assume without loss 
of generality that they are in the same row (think of the Latin 
square as an OA(n,3)). Then they have n — 2 mutual neighbors. 
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If two vertices are not joined, then they clearly have six common 
neighbors. So this is an srg(n?,3(n — 1),n — 2,6). This is called a 
Latin square graph and the notation is L3(n), in accordance with 
the notation of Example 21.2. Generalizations will occur in a later 
chapter. The graph L3(n) is geometric, corresponding to a partial 
geometry pg(n, 3,2), i.e. a net. Of course lines correspond to rows, 
columns, and symbols of the square. We shall see in Chapter 22 
that in fact nets and orthogonal arrays are equivalent concepts. 


Example 21.8. A quadrangle satisfies the conditions for a partial 
geometry. It is a pg(2, 2,1) which accounts for the name generalized 
quadrangle. 

Consider a hyperoval O in PG2(4). We take as point set the 15 
points of the plane that are not on O. Lines will be the secants of 
O. Then every line has 3 points, and every point is on 3 lines. In 
fact this is a pg(3, 3, 1), ie. GQ(2, 2). 

Note that the same construction applied to PG2(q) for q even 
yields a pg(q— 1,3(¢ + 2), 3(q — 2)). 


Example 21.9. Consider PG»(q) where qg is even. As in Exam- 
ple 19.7, this is an incidence structure whose objects are the 1- 
dimensional subspaces and 2-dimensional subspaces of Fe. Again, 
let O be a hyperoval in the projective plane. Let all 1-dimensional 
subspaces that are in O and their cosets in 3 be lines; points are 
the points of the vector space. Every line has q points; every point 
is on g+2 lines. The fact that O is a hyperoval implies that for any 
point p not on line L, there is a unique line through p that meets 
L. So we have defined a GQ(q—1,q+ 1). 


Example 21.10. Consider the subgroup G of Z§ generated by 
the element (1,1,1,1,1,1). For each coset a+ G, the sum of the 
coordinates of the points is a constant 7. We say that the coset is 
of type i. Let A; be the set of cosets of G of type i. We define a 
tripartite graph [ by joining the coset a+G to the coset a+b+G for 
each b that has only one nonzero coordinate. Clearly, any element 
of A; has six neighbors in A;,; and six in Aji9. We construct 
a partial geometry by taking some A; as point set and one of the 
other two classes A; as line set. Incidence corresponds to adjacency. 
That K = R = 6 is clear. It is an easy exercise to show that T = 2. 
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We now give a nice application of Theorem 21.5. We shall prove 
the theorem, mentioned in Chapter 19 without proof, which states 
that a quasiresidual design with A = 2 is a residual design. Since 
the cases k < 6 require separate treatment, we restrict ourselves to 
blocksize > 6. 


Theorem 21.8. Let D be a 2-(v,k,2) design with k > 6 and 
v = $k(k +1) (so D is quasiresidual). Then D is a residual of a 
symmetric 2-design. 


PROOF: Let B be a block and let a; denote the number of blocks 
(~ B) that meet B ini points. Just as in Theorem 19.9 we find 


Ya = 5 k(k +3), So ia; = k( k+1), S i(i— aj = K(k 1). 


This implies that 5>(¢ —1)(i—2)a; = 0. Therefore any two distinct 
blocks meet in 1 or in 2 points, i.e. the design D is quasisymmetric. 
From Theorem 21.2 we find that the block graph G of D is strongly 
regular. Calculation of the parameters shows that G has the same 
parameters as T(k + 2). So by Theorem 21.5, G is isomorphic to 
T(k +2). This means that we can label the blocks of D with 2- 
subsets of S := {1,2,...,4+2} in such a way that two blocks meet 
in 7 points whenever their labels meet in 2—i points (i = 1,2). We 
adjoin the set S to the point set of D, and we adjoin to each block 
its label, a 2-subset of S. Finally, we consider S as a new block. 
This produces the required symmetric design with 1 = 2 that has 
D as residual with respect to the block S. 


We list below a table of feasible parameter sets with v < 30 and 
what we have learned about them in this chapter. The only one 
left as a challenge for the reader is no. 14, i.e. GQ(2,4) for which 
no construction was given. The corresponding graph is known as 
the Schlaeflt graph. 
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No.| v)}k| A} w | Example 
1/5/2]/0]1/ PG) 

2. Go| side 2.| 303) 

3 |10| 3] 0 |1 | Petersen, 7(5) 

4 [13/6/21] 3 | P(3) 

Be S86 ae Be GO?) 

6 |16;)5] 0] 2 Clebsch 

TVG 62125) 22) Be (A) 

8) EF) Bel], 3°) 4) POT) 

9°. )21)10')3° 6.) £7) 

10 }21/10} 4 | 5 does not exist, conference 
11 /25/8/3]2/) £26) 

12/25/1215 | 6 | La(5) 

13 |26/10| 3 | 4 | STS(13), Theorem 21.2 
14 fa7/iol 1/5] GQe2,4 

15 |}28;}9| 0} 4 does not exist, Thms. 21.3, 21.4 
16 [28/121 6 | 4] T(8) 

17 [29/14| 6 | 7 | P(29) 


We give some remarks on directed strongly regular graphs and a 
few problems. 

The condition on paths of length 2 in the definition of strongly 
regular graphs is best seen by writing (21.5) as follows 


(21.5’) AJ=kJ, AM=kI+\A+pu(J—I-A). 


The definition was generalized to directed graphs by A. Duval (1988). 
We consider a directed graph G without loops or multiple edges. 
If there is an edge from a to b, we write a — b. We allow edges 
in both directions, indicated by a «+ b. In that case we call (a, b) 
an undirected edge. The generalization of (21.5’) will be the same 
with respect to paths of length 2, but we now require that each 
vertex has indegree k, outdegree & and that among these edges 
there are t undirected edges (producing paths of length 2 from a 
vertex to itself). So, G is called a directed stronly regular graph if 
its adjacency matrix A satisfies 


(21.8) AJ=JA=kJ, A? =tl+rA+p(J—I-A). 
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For such a graph on v vertices, the notation dsrg(v;k, t; A, 4) will 
be used. 


Example 21.11. Let G be a graph on six vertices consisting of a 
directed triangle 1 — 2 — 3 — 1, a directed triangle 1’ — 3’ — 
2’ — 1’, and undirected edges (i, 7’), 1 = 1,2,3. If C is the 3 x 3 
circulant matrix with c;; = 1 for (7,7) = (1,2), (2,3), (8,1) and 0 
otherwise, then the adjacency matrix A of G is 


Ome | 
A= & ) | 
One easily checks that A satisfies (21.8) with k = 2,t=1, A =0, 
and 4 = 1. So G is a dsrg(6; 2, 1; 0,1). 


Problem 21J. State and prove an analog of Theorem 21.1 for 
directed strongly regular graphs. 


Problem 21K. Exclude strongly regular graphs and graphs with 
adjacency matrix J — A. Show that the eigenvalues calculated in 
Problem 21J lead to an analog of the “half-case” if (4 — A)? +4(t— 
1) is not the square of an integer. Prove that in this case, the 
adjacency matrix A of the dsrg is of the type of Q in (18.5). 


Problem 21L. Consider the case yw = 1, 1 =0,t =k —1. Prove 
that only three values of v are possible. Show that in the second 
case, an example can be obtained from K33 by replacing vertices 
by directed triangles, and edges by copies of the graph of Example 
21.11 in a suitable way. 


We now give some comments and problems on neighborhood reg- 
ular graphs. 

In a strongly regular graph G the neighborhoods ['(x) and A(x) 
of x in G, respectively in the complement G, are regular graphs. C. 
D. Godsil and B. D. McKay (1979) called any graph that has this 
property a neighborhood regular graph. It is not difficult to show 
that a neighborhood regular graph that is also regular is in fact a 
strongly regular graph. If G or G is not connected, trivial situations 
arise. In the following problems, we assume that G is neighborhood 
regular, not regular, and that G and G are connected. 
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Problem 21M. 

(i) Show that the degree of I(x) does not depend on x. Call this 
number a. 

(ii) Show that there is a number @ such that for each x, the degree 
of A(z) is G. 


Problem 21N. If X and Y are subsets of the vertex set of G, we 
denote by |X Y| the number of edges (x,y) in G with x € X and 
y€Y. Let x; and x2 be two non-adjacent edges in G with degree 
ky, respectively ko, ky # ko. Let d; := deg A(z;), i = 1,2. Note 
that d; 4 dy. Now consider the following subsets of the vertex set 
of G:A:= T(x) N A(x), B= T'(22) FIN zy) C:= T(x) NT (x2), 
and D := A(a1)N A(ae. 

(i) Find relations between the numbers |X Y]| and |X|, where X 
and Y are among B,C’, D. 

(ii) Deduce that 


|DD| = (a+@+4+ 1)d; — d} — a|C| + |CC|. 


(iii) Prove that di) +d) =a+@+1. 
(iv) Show that all the vertices in G have degree k, or kg. 


Example 21.12. Let G be the octagon on vertices 1 to 8 with the 
two diagonals (15) and (26). This is a neighborhood regular graph 
with ky 2: ky 3, a 0, a 2. dy 1, and dy =.2; 


Problem 210. Given a Steiner triple system S on n points, define 
a graph G whose vertices are the triples A of the system and where 
A,B € A are adjacent when |AN B| = 1. 

(i) Show by elementary means (i.e. do not use Theorem 21.2 
or matrix methods) that G is strongly regular and calculate its 
parameters. 

(ii) Show that if C is a clique in G and |C| > 7, then C is a 
subset of the triples that contain a fixed point x for some point x 
of S. 

(iii) Use this to show that if the graphs G, and G» arising from 
two Steiner triple systems on n > 15 points are isomorphic, then 
the two systems are also isomorphic. (We remark that there are 
more than 163929929318400 nonisomorphic Steiner triple systems 
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on 25 points—see R. M. Wilson (1973/74)—so we have at least 
that many nonisomorphic strongly regular graphs on 100 vertices.) 


Problem 21P. Show that equality in Problem 4H cannot occur 
unless n = 3. 


Problem 21Q. Prove the so-called Friendship Theorem: At a 
party with n people (n > 3) every two persons have exactly one 
mutual friend. Then there is a unique person at the party who is 
a friend of all the others. Use problem 1J. 


Notes. 

Strongly regular graphs and partial geometries were introduced 
by R. C. Bose in 1963. A much more general concept, namely 
that of an association scheme, had been introduced by Bose and 
Shimamoto in 1952; see Chapter 30 here and Cameron and Van 
Lint (1991), Ch. 17. 

For a survey of construction methods for strongly regular graphs, 
see Hubaut (1975) and Brouwer and Van Lint (1982). 

The Clebsch graph may be defined by the 16 lines in the Clebsch 
quartic surface, a pair of lines being adjacent if and only if they 
are skew. Alfred Clebsch (1833-1872) was a mathematician and 
physicist who worked in Karlsruhe, Giessen, and Gottingen. He was 
one of the founders of the famous journal Mathematische Annalen. 

The algebra 2 was introduced by R. C. Bose and D. M. Mesner 
in 1959. 

The reference Goethals and Seidel (1970) contains, besides The- 
orem 21.2, many interesting connections between strongly regu- 
lar graphs and other combinatorial designs. Hadamard matrices, 
Steiner systems and the Golay code all contribute to the theory. 
For these connections, again see Cameron and Van Lint (1991). 

The Higman-Sims graph is connected to their famous finite sim- 
ple group (see Higman and Sims (1968)). 

Theorem 21.3 became known as the Krein condition because a 
special case of this theorem was proved by L. L. Scott by applying 
a result of M. G. Krein concerning topological groups to a problem 
on finite groups. The simple proof given in this chapter is due to D. 
G. Higman (1975) and P. Delsarte (1973). The case of equality in 
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the Krein conditions was treated by Delsarte, Goethals and Seidel 
(1977) and Cameron, Goethals and Seidel (1978). 

The idea of the proof of Theorem 21.4 is due to T. H. Koorn- 
winder (1976). 

Theorem 21.5 was proved several times: see for example, L. C. 
Chang (1959), W. S. Connor (1958), A. J. Hoffman (1960). The 
three exceptional graphs for n = 8 are known as the Chang graphs. 

For surveys on partial geometries, see Van Lint (1983), De Clerck 
and Van Maldeghem (1995), and De Clerck (2000). 

Nets were introduced by R. H. Bruck in 1951. Bose’s result on 
pseudo-geometric graphs was inspired by Bruck’s work (e.g. the 
idea of grand cliques). 

For all that one would like to know about generalized quadrangles 
(and more) we refer to the book Finite Generalized Quadrangles by 
S. E. Payne and J. A. Thas (1984). 

Only three proper partial geometries with T’ = 2 are known. The 
geometry pg(6,6,2) of Example 21.9 was first constructed by Van 
Lint and Schrijver (1981); the construction that we described is due 
to Cameron and Van Lint (1982). The other known examples are 
a pg(5, 18,2) due to Haemers and a pg(8, 20,2) found by Mathon 
(with the aid of a computer). 

A big step in the direction of the proof of the nonexistence of 
a projective plane of order 10, mentioned in Chapter 19, was the 
proof of the nonexistence of pg(6,9,4). This implied (see Example 
21.8) that, if the plane existed, it would not have any hyperovals. 
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22 


Orthogonal Latin squares 


Two Latin squares L}: Rx C— S and Ly: Rx C —T (with the 
same row and column sets) are said to be orthogonal when for each 
ordered pair (s,t) € S x T, there is a unique cell (x,y) € R x C' so 
that 

Ii(z,y)=s and Io(x,y)=t. 


The use of the word ‘orthogonal’ is perhaps unfortunate since it has 
other meanings in mathematics, but it has become far too common- 
place to try to change now. 

For example, let 


A kK Q J & 0° 6% & 
ee old pul* oo 4 
Se Oe FE A? “ |O @ & © 

kK A J Q > & & YO 

Figure 22.1 


where R = C = {1,2,3,4}, S = {4,4,Q,J}, T = {@,9,¢, &}. 
The orthogonality of A and B is evident when the two squares are 
superposed and we see that each element of S' x T appears exactly 


once. 
A@ KO QO Jh 
Qh& J> AY KA 
JO Q@ Ke A 
KO Ad& Je QY 
Orthogonality does not depend on the particular symbol set used. 
We could replace the playing card suits in our example by any 
four symbols in any order, and the resulting square would still be 
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orthogonal to A. We have used Latin letters for the symbols of A; 
if we had used Greek letters for B, it would have been clear why 
the superposition of two orthogonal Latin squares is often called a 
Greco-Latin square. 

The example we have given with playing cards is similar to what 
is known as Euler’s 86 officers problem. According to folklore, 
Euler was asked by Catherine the Great (at whose court he was in 
residence) to arrange 36 officers from six different regiments and 
of six different ranks (one officer of each rank from each regiment) 
in a 6 by 6 array so that each row and each column contained one 
officer of each rank and one officer of each regiment. A solution 
requires a pair of orthogonal Latin squares of order 6. 

Surprisingly, there is no solution. Whether he was asked by 
Catherine or not, Euler did consider the problem in 1779 and con- 
vinced himself that it was impossible. Euler was capable of monu- 
mental calculation, but it is not clear that he had really examined 
all cases. This was systematically done in 1900 by G. Tarry. Today 
a computer can do this easily. A short proof (still involving several 
cases to be analyzed) was given by D.R. Stinson (1984). 

Euler knew that a pair of orthogonal Latin squares of order n 
existed for all odd values of n, and all n = 0 (mod 4); see Theorem 
22.3 below. For examples of orthogonal Latin squares of odd orders 
n, let G be any group of order n and define squares L; and Ly with 
row, column, and symbol sets G by 


ii(aa) Sty, - “Ineo: 


For completely trivial reasons, there is no pair of orthogonal 
Latin squares of order 2. Euler asserted that he was of the opin- 
ion that this impossibility for n = 2,6 extended as well to n = 
10,14,18,..., ie. to orders n = 2 (mod 4). This statement was 
known as ‘Euler’s conjecture’ for 177 years until it was suddenly 
and completely disproved by Bose, Parker, and Shrikhande. We will 
prove later their theorem that pairs of orthogonal Latin squares of 
orders n exist for all n except n = 2,6. 

At various times, a more general question has been raised: can 
one find sets of many Latin squares, any two of which are orthogo- 
nal? We will denote by N(n) the largest integer k for which there 
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exist k Latin squares of order n that are pairwise orthogonal. For 
example, we may add 


a 
0 
a 


RDWRO 
2S ORD 
Ok BR 


p 


(the addition table of the Klein four-group) to the squares in Fig. 
22.1 to obtain three pairwise orthogonal Latin squares of order 4; 
thus N(4) > 3. We remark that the definition of orthogonality is 
trivial or vacuous for n = 1 or 0, and so N(1) = N(0) = oo. 

Here is a construction that proves N(q) > q—1 for a prime power 
q. All row, column, and symbol sets are to be the elements of the 
field F,. For each nonzero element a in F,, define Lq(x,y) := ax+y; 
these gq — 1 squares are pairwise orthogonal. As we all know, the 
systems 


ax + by =s 
co +dy=t 
have unique solutions (x,y) when ad — bc ¥ 0, so it is easy to tell 


when two squares defined by linear equations are orthogonal. Here 
are the squares for q = 5: 


0 12 3 4 0 12 3 4 
12 3 4 0 23 4 0 1 
23 4 0 1], 4012 3], 
3.4 0 1 2 12 3 4 0 
4012 3 3.4 01 2 
0 12 3 4 0 12 3 4 
3.4 0 1 2 4012 83 
Le ae Be Aw Ory 3.4 0 1 2 
4012 3 23 4 0 1 
23 4 0 1 12 3 4 0 


Theorem 22.1. For n > 2, we have 1 < N(n) <n-1. 


PROOF: We may change all row, column, and symbol sets in a set 
of k pairwise orthogonal Latin squares {L;}*_, to {1,2,...,n} (for 
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notational convenience), and we might as well rename the symbols 
if necessary so that the first row of each square is 1, 2,...,7 in that 
order. Now we consider the elements 


HOD) TOD); 0g BOD) 


in the second row, first column. None of these is 1, since 1 occurs 
already in the first column. They are distinct because if D;(2,1) = 
L;(2,1) = s, say, then the fact that also L;(1,s) = L,(1,s) = s 
would contradict the orthogonality of L; and L;. The upper bound 
follows. 


In view of our construction from finite fields and the simple the- 
orem above, we have 


O21) N(q)=q-1 if qisa prime power. 


At this point, we want to show that the concept of pairwise or- 
thogonal Latin squares is equivalent to something the reader has 
already seen in the previous chapter! The assertion of the theo- 
rem will be more immediately apparent if we first notice that the 
property of being a Latin square can itself be described in terms of 
orthogonality. For example, a square of order 5 is Latin if and only 
if it is orthogonal to the two squares 


and 


LKwWwWnDNer © 
LrwonNnr © 
LrwnNynr © 
rwWwWDNere © 
LrwnDNer © 
oOo oO oO 
See eee 
www Wd Ww 
Wwwww 
EP LP 


Theorem 22.2. A set of k pairwise orthogonal Latin squares of 
order n exists if and only if an (n,k + 2)-net exists. 


PROOF: Suppose L;: Rx C — S;,1<i<k, are pairwise orthog- 
onal Latin squares. Let P = R x C, the set of n? cells. Roughly 
speaking, we take as lines the rows, the columns, and the ‘iso- 
symbol lines in each square. More formally, let 


Ai = {{(z,b): bE Ch}: a2 € R}, 
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Az = {{(a,y): ae RE ry € CH, 
Ai+2 = {{(2, y) : Li(z, y) = c} Nee Diy. 1 Sisk, 


and let B = US?,A;. That (P, B) is an (n,k + 2)-net follows from 
the definitions of Latin and orthogonal. 

Given an (n,k+2)-net (P, B), where B = U"*?A; is the partition 
into parallel classes, define L; : A; x Az > Aj+2 by declaring that 
L;(A, B) is to be the unique line in A;+2 containing the point of 
intersection of A and B. As usual we leave to the reader the details 
of checking the Latin and orthogonal properties of these squares. 


Corollary. N(n) = n —1 if and only if there exists a projective 
(or affine) plane of order n. 


The value of N(n) tells us, in some very imprecise sense, how 
close to the existence of a projective plane of order n we may get. 
But we know terribly little about this function. The only value of 
n other than prime powers for which N(n) is known is n = 6. 


Problem 22A. (i) Consider the complement of the graph of an 
(n,n)-net and prove that N(n) > n—2 implies that N(n) =n-1. 

(ii) Use the result of Problem 21F to prove that ifn > 4, then 
N(n) > n—3 implies that N(n) =n —-1. 


Theorem 22.3. 

(i) N(nm) > min{N(n), N(m)}. 

(ii) If n = p\'ps?---pt is the factorization of n into powers of 
primes, then 

N(n) 2 min (p{* — 1). 

PRoorF: Part (ii) follows from part (i) and (22.1), used inductively. 

To prove part (i), we first define a Kronecker-like product for 
squares in the natural way: Given DL; : R; x C; — Sj, 1 = 1,2, 
define 

Ly ® Lo : (R, x Ro) x (C1 x C2) =z (Si) x S9) 


by 


(Li ® Lo) ((x1, £2), (Yi, yo)) == (Li(1, y1), Lo(x2, y2)). 
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It is straightforward to verify that if {A;}*_, are pairwise orthogonal 
Latin squares of order n and {B;}*_, are pairwise orthogonal Latin 
squares of order m, then {A;@B;}*_, are pairwise orthogonal Latin 
squares of order nm. We leave this task to the reader, but advise 
you just to read on and not bother. 


Theorem 22.3 is known as MacNeish’s theorem. MacNeish con- 
jectured in 1922 that equality held in Theorem 22.3(ii). This would 
imply Euler’s conjecture. But combinatorial problems of this type 
rarely have such simple answers. 

The MacNeish conjecture was disproved first for n = 21 when 
three Latin squares of order 21 were constructed with the aid of 
the 21 point projective plane of order 4; see Example 22.1 below. 
This was an example of composition methods, which also provided 
the first counterexample to Euler’s conjecture (n = 22). The first 
pair of orthogonal Latin squares of order 10 was found by Parker by 
what might be called difference methods. We discuss composition 
methods first; they have proved more powerful for large values of 
n. 

Bose, Parker and Shrikhande gave many constructions for sets 
of orthogonal Latin squares. We choose to describe two basic and 
elegant ones in terms of quasigroups. Recall that a quasigroup is a 
Latin square whose row, column, and symbol sets are the same set 
X. A quasigroup L is idempotent when L(x,x) = x for all x € X. 
For example, if X is the finite field F, of order q, then 


LE,(z,y) = ax+(1—a)y 


defines an idempotent quasigroup L, if a 4 0,1. Any two squares 
of this form are orthogonal. 


Problem 22B. For a prime power q > 4, construct two Latin 
squares A,S of order q with row and column sets equal to F, so 
that A is orthogonal to its transpose, and to S, which is to be 
symmetric. In addition, ensure that S is idempotent if q is odd 
and ‘unipotent’, i.e. has constant diagonal, if q is even. 


Suppose we have a linear space, cf. Chapter 19, on a point set X 
with line set A. Further suppose that for each line A in A, we have k 
pairwise orthogonal idempotent quasigroups L+}, L4,..., IA on the 
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set A. Then we can construct k pairwise orthogonal idempotent 
quasigroups Ly, Le,...,L, on the entire set X by declaring, for 
each i = 1,2,...,k, that D,(2,x) := x for x € X, and for distinct 
x,y € X, Li(x,y) := LA(x,y), where A is the unique line in A 
which contains both x and y. It is a simple matter to check that 
I, I9,..., L, are Latin squares and that they are orthogonal; for 
example, given i,j,s and t, s #£ t, acell (x,y) for which 


Lilay) ='s and. Lj(ajy) =t 


exists, and x,y may be found in that line B which contains s and 
t, because of the orthogonality of L? and L? z 


Theorem 22.4. For every linear space with line set A on an n-set 
X, we have 
N > min{N(|A]) — 1}. 
(n) 2 min{N(|A]) — 1} 


PRooF: Let k be the indicated minimum. This means we have at 
least k + 1 pairwise orthogonal quasigroups on each A € A. We 
will describe how to obtain k pairwise orthogonal idempotent quasi- 
groups on each A. Then N(n) > k follows from the construction 
above. 

In general, let Hy), H2,...,H,4, be pairwise orthogonal quasi- 
groups on an m-set B. Pick any b € B. There will be m cells 
(x,y) for which H;,41(x,y) = 6, one in each row and one in each 
column. Simultaneously permute the columns, say, of all squares 
so that these cells are on the diagonal of the (& + 1)-th square. 
Orthogonality implies that for each i < k, all m symbols occur on 
the diagonal of the 7-th square. Finally, we permute the symbols 
independently in each of the first k squares so that the resulting 
squares are idempotent. 


Theorem 22.5. If A is the set of lines of a linear space on an n-set 
X and B C Ais a set of pairwise disjoint lines, then 


N(n) > min ({N(|A]) —1: 4 € A\B} U {N(\B]) : B € BY). 


PROOF: Let k be the indicated minimum. As we saw in the proof 
of the preceding theorem, there exist k pairwise orthogonal idem- 
potent quasigroups L/ on each A in A\B. If necessary, we add 
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singleton sets to B so that B becomes a partition of X. We have 
k pairwise orthogonal quasigroups L? (not necessarily idempotent ) 
on each B € B. We define Lj, Lo,...,L, on the entire set X by 
declaring, for each i = 1,2,...,k, that L;(2,2) := L?(x,2) where 
B is the unique member of 6 that contains x, and for distinct 2, y, 
L,(x,y) := LA(x,y), where A is the unique line in A (whether or 
not in B) that contains both x and y. The quasigroups L; are or- 
thogonal (the easy details are left to the reader) and thus N(n) > k. 


Example 22.1. Let n = 21 and consider the projective plane of 
order 4 with 21 lines of size 5. Theorem 22.4 implies that N(21) > 3 
and this disproves MacNeish’s conjecture. Delete three noncollinear 
points. We have a linear space with 18 points, and lines of sizes 5, 
4, and 3; the three lines of size 3 are pairwise disjoint. Theorem 
22.5 implies that N(18) > 2 and this disproves Euler’s conjecture. 


Example 22.2. To see how far wrong Euler was, we consider two 
more values of n = 2 (mod 4). Delete three noncollinear points 
from a projective plane of order 8 to obtain a linear space on 70 
points with lines of sizes 9, 8, and (pairwise disjoint lines of size) 7. 
Theorem 22.5 shows N(70) > 6. It is possible to find seven points 
in the plane constructed from Fg such that no three are collinear 
(even 10, see Problem 19I), and deleting these produces a linear 
space which can be used in Theorem 22.4 to give us N(66) > 5. 


The next construction would be difficult to describe without the 
use of transversal designs. Transversal designs provide a compact 
and conceptually convenient language with which to manipulate 
sets of pairwise orthogonal Latin squares. (They also provide a 
large family of linear spaces useful for the construction of other 
combinatorial designs or even more orthogonal Latin squares.) A 
TD(n, k) can be defined as the dual incidence structure of an (n, k)- 
net, i.e. as an (n,k,k —1)-partial geometry. So there are nk points 
and n? blocks (we will use ‘blocks’ rather than ‘lines’ when dis- 
cussing transversal designs). Each point is in n blocks; each block 
contains k points. The points fall into k equivalence classes (called, 
confusingly, groups) of size n so that two points in the same group 
are not contained in a block while two points in different groups 
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belong to exactly one block. In particular, each block contains ex- 
actly one point from each group. We need to refer to the groups 
so often that we incorporate them into the notation and speak of 
a transversal design (X,G,.A) where the three coordinates are the 
sets of points, groups, and blocks, respectively. (X,G UA) is a 
linear space with block-sizes k and n. 


Example 22.3. Here is a TD(2,3): 
Groups: {a4, a2}, {b1, bo}, {ce1, co} 
Blocks: {a1, by, er}; {a1, by, co}, {a2, by, co}, {a2, by, cy}. 


In view of Theorem 22.2, the existence of a TD(n, k+2) is equiva- 
lent to the existence of k pairwise orthogonal Latin squares. At the 
risk of boring some readers, we review the connection by quickly 
describing how to get the squares from a TD(n,k + 2): number 
the groups {Gj, Go,...,Gr+2} and define L; : Gi x Gz — Gii2 by 
declaring L;(x,y) to be the point of intersection of G;,. and the 
block which contains x and y. 

A TD(n,k) is said to be resolvable when the set A of blocks 
can be partitioned into n parallel classes A), Ao2,...,An, i-e. each 
A; is a set of n blocks of size k which partition the point set X. 
From a TD(n,k) (X,G,A), we can always construct a resolvable 
TD(n,k — 1) by deleting the n points of one of the groups Gop = 
{@1,1,---,%n}, which removes exactly one point from each block; 
for each i, the blocks which have had x; removed form a parallel 
class on the point set X\Go of the TD(n,k — 1). (Conversely, a 
resolvable TD(n, k — 1) can be extended to a TD(n, k).) 


Theorem 22.6. If0 <u <t, then 
N(mt + u) > min{N(m), N(m+ 1), N(t) — 1, N(u)}. 


PRooF: Let k be the right-hand side above, plus 2. This means 
that transversal designs TD(m,k), TD(m+1,k), TD(t,k+1), and 
TD(u,k) exist; and to prove the theorem, we must construct a 
TD(mt + u,k). 

As the construction is rather technical, we warm up by first de- 
scribing the construction of a TD(mt, k) from a TD(t, k) and vari- 
ous TD(m,k)’s. This is the degenerate case u = 0 in the theorem. 
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The construction does not require a TD(m+1,k) or a TD(t,k +1) 
and thus it reproves Theorem 22.3(i). 

Let (X,G,A) bea TD(t,k). To each x € X, associate a set M, of 
m new elements so that any two sets M, are disjoint. For S C X, 
let Mg := Uses My. 

We construct a TD(mt,k) on the point set Mx of size kmt with 
groups {Mg : G € G} each of size mt; the blocks B are obtained as 
follows. For each A € A, choose blocks By, so that 


(M4, {M, :a € A}, Bag) 


is a TD(m,k), and let B = Uye4By. The verification is straight- 
forward. 

To return to the general case, recall that the existence of a 
TD(t,k +1) implies the existence of a resolvable TD(t,k). So we 
have a TD(t,k) (X,G,A) where A admits a partition into parallel 
classes A,, Ao,...,A;. We treat the blocks of the first u parallel 
classes in one way, and the blocks in B := Uj_,,,;.A; in another way. 

Let (U,H,C) be a TD(u,k). We also require a partition of U 
into u k-subsets {Ky, Ko,...,K,}; each is to consist of exactly one 
point from each set H € H, but these k-subsets are not required to 
be blocks in C. 

Let G = {Gi, Go,...,G,} and H = {M), Ho,..., H,} be number- 
ings of the groups. We construct a TD(mt + u,k) with point set 
Y := Mx UU and groups 


J :={Me,U Ai, Me, U Ao,..., Me, U He}. 


The blocks are obtained as follows. For each block B € B as before, 
let 
(Mz, {M, 1LE Bh, Dz) 


be a TD(m,k). For each block A € CalAj, let 
(Ma U Kj, (Man Mg,) U (Ki H;) ae a LQ ieasg hy, 2A) 


be a TD(m+1,k) in which K; occurs as a block, and let D’, denote 
the remaining (m+ 1)? — 1 blocks. Then we claim that (Y, 7, €) is 
the required TD(mt + u,k), where 


E:=cu( (J Ds)u( U Di). 


BeB AEAU UAy 
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Verification requires consideration of several cases. 


Example 22.4. With m = 3 in Theorem 22.6, we see that 


reer +u) >2 whenever 0<u<t,N(t) > 3, and N(u) > 2. 
We may take (t,u) = (5,3),(7,1), (7,5), and (9,3) to find that 
N(n) > 2 for n = 18, 22,26, and 30. 

Theorem 22.7. N(n) > 2 for alln F 2,6. 

PROOF: We need only consider n = 2 (mod 4). For n = 10,14, 


see Examples 22.6 and 22.7 below. For n = 18, 22,26, and 30, see 
Example 22.4 above. We now assume n > 34. One of 


n—-1,n-—3,n-—5,n-7,n-9,n-11 
is divisible by 3 but not by 9, so we can write n = 3t + u where 
u = 1,3,5,7,9, or 11, and ¢ is not divisible by 3. Since n is even, 
t is also odd, so N(t) > 4 by Theorem 22.3(ii). A consequence of 
n> 34ist>11,so0<u<t, and then N(n) > 2 by (22.2). 


Theorem 22.8. N(n) — o asn— o. 


Proor: Let x be a positive integer. We claim that N(n) > x —1 
whenever 


(22.3) n>2 (I>) 


psx 
where the product is extended over all primes p < zx. 

Given n satisfying (22.3), let m be chosen (Chinese remainder 
theorem) so that 0 < m < ([],<,p)* and such that for all primes 
pS, 

_ { —1 (mod p*) _ if p divides n, 
—— { 0 (mod p*) if p does not divide n. 
Then choose an integer t = 1 (mod [J,,.,,p) so that 


z+1 
0<ucn—mt< ([]>} ; 


pSa 


The remainder of the proof is provided by Problem 22C below. 
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Problem 22C. With m, t, and u chosen as above, show that u < t 
and that Theorem 22.3(ii) implies that N(mt+u) >a-—- 1. 


We have seen constructions by so-called difference methods in 
Chapter 19. We use this idea to construct Latin squares here. It 
is convenient to describe our constructions in terms of orthogonal 
arrays. An OA(n,k) isak x n? array (or we prefer here to think of 
a set of n? column vectors of height k since the order of the columns 
is immaterial) whose entries are taken from a set S of n symbols, so 
that for any distinct 7,7, 1 < 7,7 < k, and any two symbols s,t from 
S, there is a unique column whose 7-th coordinate is s and whose 
j-th coordinate is t. This generalizes the OA(n,3)’s introduced in 
Chapter 17. 

This is yet another equivalent formulation of orthogonality; the 
existence of an OA(n,k + 2) is equivalent to the existence of k 
pairwise orthogonal Latin squares. For example, to get the array 
from squares L 1, L2,..., L,, assume without loss of generality that 
all row, column, and symbol sets are the same set S and take all 
columns 


li, 9, Li(é, 7), Lol4,9),---, Delt, a)", (en ae 
Readers should immediately be able to write out two orthogonal 


squares from the OA(3, 4) in Fig. 22.2 (of course, any two rows may 
be used to ‘coordinatize’ the squares). 


2 Ee Oe OE ee 

Lh 2 ae a ee Oe 

Ba ES Be Ea a BE 

i Sa aa tame | Ga Sa | 
Figure 22.2 


Example 22.5. The following matrix of elements of Z,; was found 
with the aid of a computer by Schellenberg, Van Rees, and Vanstone 
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(1978): 

000000000000000 

0123 4567 8 9 1011121314 

02579124 114113 6 8 1013 

06314107134112851129 

01061112 7123 8134149 5 
It has the property that for any two rows, the differences between 
the coordinates of those two rows comprise all elements of Z,;, each 
with multiplicity one. It follows that we obtain an OA(15,5) by 
taking all translates of these columns by elements of Z;;. Thus three 
pairwise orthogonal Latin squares of order 15 can be constructed. 
But the OA(15,5) is resolvable: the 15 translates of any column 
have the property that in each coordinate, all symbols occur once. 
As with transversal designs, we can add a new row to obtain an 
OA(15, 6) and hence four pairwise orthogonal Latin squares of order 
15. It is not known whether N(15) > 5. 


The last three rows of the above matrix are what we call below 
pairwise orthogonal orthomorphisms of Z5. 

In 1960, Johnson, Dulmage, Mendelsohn and a computer proved 
by this method that N(12) > 5, using the group Zz @ Z2 @ Z3. The 
cyclic group of order 12 is of no use; see Theorem 22.9 below. 


Example 22.6. We will describe a construction that proves 
N(m) > 2 => N(8m+1) > 2. 


Let G be an abelian group of order 2m+1 and let M be a system of 
representatives for the pairs {i,—7} (¢ 40) in G. That is |M| =m 
and G = {0} UMU —M. For each i € M, introduce a new symbol 
oo; and take S := GU {oo; : i € M} as the symbols set for an 
OA(3m + 1,4). 

Consider the set of all column vectors obtained from 


0 0:1 £0 ' fj 
0 lhe eaeel| es Mo 

ay dasa Oe EN Mel eo Pica 
0 = : Oe, (6s: 


by translating them by the elements of G with the understand- 
ing that the oo’s are fixed, ie. 00; +g = co;. This gives us 
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(4m +1)(2m +1) columns. Add to these the m? columns of an 
OA(m, 4) to obtain an OA(3m + 1,4). 

To be more explicit in the case m = 3, let us use symbols Z7 U 
{z,y,z} and M = {1,2,3}. We take the seven translates of the 
following 13 columns, and adjoin the OA(3, 4) from Fig. 22.2: 


OF a DD) 6 sy: OD ee oO bd 
Oo 7Orosae We SD Gs. age a 2 Oe ee ee 
016202 5 y 0 3 4 z O 
061025 20 y 43 0 2 


Verification of the construction requires special consideration of 
the symbols x, y, z (which we have used rather than the co’s), and 
the observation that the differences between any two of the above 
rows (when both coordinates are in Z7) comprise all the elements 
of Z7, each with multiplicity one. The resulting squares are shown 
below. 


Ozly2a23654 O4r5y62123 
41z22y32x2065 zl5x6y0234 
t5223y4106 Llz26x20y345 
5263824y210 y2z2302721456 
y6x20425321 2y3241272560 
6y0x2152432 a3y42z252601 
z0Oylr26543 3z24y526012 
1234560ayz 60123 452xry2 
2345601lza2y S601234y22 
3456012 y22 4560123 2za2y 


Example 22.7. We will describe an OA(14,4) on the symbols 
Zi U {x,y,z}. We take the 11 translates by elements of Z1; of the 
following 17 columns, and adjoin the OA(3, 4) from Fig. 22.2: 


Or. OO ae Te) ae a ee Oe 2, Be Ore B90 
0.10) 6.4.0. oe ol AO ee 26 Oe. BY 
O 4° 1°06 4. 0-2 1.600 -y 2 Oo Oo zB 
0-6 4 1 0 1 40-2 2 6 0 y 8 9 O 2 


Once again, verification of the construction requires special consid- 
eration of the symbols x,y, z (which we have used rather than the 
oo’s), and the observation that the differences between any two of 
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the above rows (when both coordinates are in Z;,) comprise all the 
elements of Z,, each with multiplicity one. 


Problem 22D. Show that the existence of k pairwise orthogonal 
idempotent quasigroups is equivalent to the existence of a TD(n, k+ 
2) in which there may be found a parallel class of blocks. 


An orthomorphism of an abelian group G is a permutation o of 
the elements of G such that 


rreo(r)—-2 


is also a permutation of G. The reader can check that the square 
L(x, y) := a(x) +y is Latin if and only if o is a permutation, and is 
orthogonal to the addition table A(z, y) := «+ y of G if and only 
if o is an orthomorphism. 

We remark that if there is any orthogonal mate to A, then there 
is an orthomorphism: the cells in the positions where an orthogonal 
mate contains a given symbol have the property that there is ex- 
actly one in each row and column, and so are of the form (z,7(2)), 
x € G, for some permutation 7. Since these cells must contain 
different symbols in A, the mapping o defined by o(ax) := 4+ T(z) 
is a permutation. In view of this remark, the following theorem 
proves that cyclic squares of even order have no orthogonal mates, 
i.e. there are no Latin squares orthogonal to it. 


Theorem 22.9. If an abelian group G admits an orthomorphism, 
then its order is odd or its Sylow 2-subgroup is not cyclic. 


PROOF: Ifthe Sylow 2-subgroup is cyclic and nontrivial, then there 
is exactly one element z of G with order 2. When we add all 
elements of G, each element pairs with its additive inverse except 
z and 0; so the sum of all elements of G is z. But if o were an 
orthomorphism, then 


a contradiction to our choice of z. 
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Remarks. If G is abelian and has odd order, then it has automor- 
phisms fixing only the identity (this is true for some abelian groups 
of even order also), and these are orthomorphisms. Even if G is 
not abelian, one can define orthomorphisms (also called complete 
mappings) and it remains true that these do not exist if the Sylow 2- 
subgroup is cyclic and nontrivial—see Hall and Paige (1955), where 
complete mappings are also shown to exist for solvable groups with 
trivial or noncyclic Sylow 2-subgroups. 


H. B. Mann (1950) observed that a Latin square of order 4t + 1 
with a subsquare of order 2¢ has no orthogonal mates. Similarly, a 
Latin square of order 4t+ 2 with a subsquare of order 2¢+ 1 has no 
orthogonal mates. Both these results are corollaries of part (ii) of 
the following theorem, which also proves that none of the pairs of 
orthogonal squares of orders 12t + 10 we constructed in Example 
22.6 can be extended to a set of three or more pairwise orthogonal 
Latin squares. We leave it to the reader to convince himself that 
subsquares correspond to subtransversal designs. 


Theorem 22.10. Let (X,G,A) be a TD(n,k) which contains a 
sub-TD(m,k) (Y,H,B) with m <n. (This means Y C X, H = 
{GNY:GeEG}, andBC A.) Then 

(i) m(k—1) <n; 

(ii) if (X,G,A) is resolvable, then 


Proor: Pick a point 79 € X\Y. Say xo belongs to Go € G. For 
each of the m(k — 1) points y € Y\Gp, there is a unique block 
A, € A with {x,y} C A,. A block containing two points of Y 
must belong to B (and could not contain x9), so these m(k — 1) 
blocks are distinct. This number cannot exceed the total number 
of blocks on x9, which is n. 

Suppose that (X,G,.A) is resolvable and that {A;,Ao,...,An} is 
a partition of A into parallel classes. Let s; denote the number of 
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blocks of A; which are contained in Y and let t; denote the number 
of blocks of A; which meet Y in exactly one point. Then 


ks, +t; =|Y|=mk and s;4+t <n, 


from which it follows that (k — 1)s; > mk — n, or equivalently, 


mk—-n 
oT | 


Part (ii) now follows from m? = |B] = S>y_, s;. 


Problem 22E. Prove that N(24) > 3 and N(33) > 3. 


Problem 22F. Suppose we have c— 1 mutually orthogonal Latin 
squares of order k Then we have an OA(k,c+1) which we represent 
as follows: 


da? Ti+ apace. Mtlse Oe 8 2 k k k; 
de 2, ee, OR 
: Ay Ap-1 
ee hte Sie 
where every row of each matrix A; is a permutation of 1,2,...,k. 


Let A be the c x k(k — 1) matrix formed by the matrices A;. Use 
this matrix and the matrix S' of Problem 19U to prove the following 
theorem. 


Theorem. If there exists asymmetric (v, k, 1)-design, then N(v) > 
N(k). 


Problem 22G. Generalize Theorem 22.6 to the following. 
Theorem. [f0 <u<t,0<v <t, then 


Problem 22H. Show that N(21) > 4. 
Problem 22I. Show that N(51) > 4. 
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Problem 22J. Suppose that by giving examples one has shown 
that N(n) > 3 fora < n < 7a+3. Show that N(n) > 3 for 
7a <n < 7a and conclude that N(n) > 3 for n >a. 


Notes. 

The term ‘mutually orthogonal Latin squares’ rather than ‘pair- 
wise orthogonal Latin squares’ and its acronym ‘MOLS’ are in com- 
mon usage. 

We have already mentioned Euler in the notes to Chapter 1. 
Euler’s paper of 1782 was titled ‘A new type of magic square’, 
because Euler had noticed that special magic squares arise from 
a pair of orthogonal Latin squares. As an example, change the 
symbol sets of A and B in Fig. 22.1 to {4,3,2,1} and {12,8, 4, 0}, 
respectively, and add, rather than superpose, to obtain 


16 11 6 #1 
2. <a AD 16 
9 14 3 8 
~ 4 #13 = «10 


Each line of this matrix sums to 34. (The Latin property does not 
in general imply that the diagonals sum to the ‘magic number’, but 
our example is a particularly nice square: the diagonals, the four 
corners, and many other sets of four entries sum to 34.) 

The authors are aware that ‘A’ is used in Fig. 22.1 both as the 
name of a matrix and an entry in that same matrix. We are just 
checking whether readers are on their toes. 

Orthogonal Latin squares, often in the form of orthogonal ar- 
rays, are of great importance in the statistical theory of design of 
experiments. 

MacNeish’s paper of 1922 actually includes a false proof of Euler’s 
conjecture. 

The connection between orthogonal Latin squares and finite pro- 
jective planes was observed by R. C. Bose (1938). 

Many results on orthogonal Latin squares were anticipated by 
E. H. Moore (1896), as has been pointed out to the authors by R. 
D. Baker. Unfortunately, the term Latin square is nowhere to be 
found in Moore’s paper, so no one noticed his results for a long 
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time. The results in his paper include special cases of Theorem 
22.3 and Theorem 22.4 (albeit in entirely different forms) as well as 
descriptions of projective planes of prime power order, so he could 
have disproved the MacNeish conjecture had he known of it. 

Theorem 22.6 is a special case of a construction of Wilson (1974) 
which in turn has been generalized in several ways. See T. Beth, 
D. Jungnickel, and H. Lenz (1986). 

Using methods similar to Problem 22J, the second of the present 
authors proved that N(n) > 3 for n > 47. Presently it is known 
that N(n) > 3 for n > 10. See Colbourn and Dinitz (1996) for 
recent lower bounds on N(n) for various n. 

Theorem 22.8 is due to Chowla, Erdés, and Straus (1960). They 
proved more: N(n) > n'/9! for sufficiently large n. Their work 
is number-theoretical and is based on the Bose—Parker—Shrikhande 
constructions. Their result has been improved several times, but 
the best result to date is that of T. Beth (1983). This seems far 
from the real truth. It is quite possible that N(n) > n/10, say, for 
all, or for all but finitely many, values of n, but at the present time 
this appears to be incredibly hard to prove. 
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23 


Projective and combinatorial 
geometries 


A combinatorial geometry is a pair (X,F) where X is a set of points 
and where F is a family of subsets of X called flats such that 


(1) F is closed under (pairwise) intersection, 

(2) there are no infinite chains in the poset F, 

(3) F contains the empty set, all singletons {x}, « € X, and the 
set X itself, 

(4) for every flat EF € F, E # X, the flats that cover F in F 
partition the remaining points. 


Here, F' covers E in F means that E,F € F, E S F, but that 
E G G c F does not hold for any G € F. This latter property 
should be familiar to the reader from geometry: the lines that 
contain a given point partition the remaining points; the planes 
that contain a given line partition the remaining points. 

A trivial example of a geometry consists of a finite set X and all 
subsets of X as the flats. This is the Boolean algebra on X. 

We remark that (1) and (2) imply that F is closed under arbitrary 
intersection. 


Example 23.1. Every linear space (as introduced in Chapter 19) 
gives us a combinatorial geometry on its point set X when we take 
as flats 0, all singletons {{a}:a € X}, all lines, and X itself. The 
fact that the lines on a given point partition the remaining points 
is another way of saying that two points determine a unique line. 


Example 23.2. Every Steiner system S(t,k,v) gives us a com- 
binatorial geometry on its point set X when we take as flats all 
subsets of cardinality < t, all blocks, and X. To get a geometry by 
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this construction, it is not necessary that all blocks have the same 
size, but only that each t-subset is contained in a unique block 
(cf. Problem 19R, where we have called these Generalized Steiner 
Systems). 


Example 23.3. Let V be an n-dimensional vector space over a 
field F. By an affine subspace of V, we mean either the empty 
set or a coset (or translate) of an (ordinary, i.e. linear) subspace 
of V in the additive group. For example, for a,b € F, the subset 
{(x,y) : y= ax+)} is an affine subspace of F?. The set V together 
with all affine subspaces forms a combinatorial geometry called the 
affine geometry AG,,(F). We write AG,,(q) for AG,,(F,); the case 
n = 2 was introduced in Chapter 19. 


Example 23.4. The projective geometry PG,,(F), while more fun- 
damental than AG,,(F), can be a little more awkward to define. 
Let V be an (n+1)-dimensional vector space V over a field F. The 
point set X of PG,,(F) is to consist of all the 1-dimensional (linear) 
subspaces of V. For example, if F is the field F, of g elements, then 
the number of projective points of PG,,(F, ) is 


iselse =n 


a Hgts +@tq4tl. 


To each linear subspace W of V, we associate a flat Fy consisting 
of all the 1-dimensional subspaces of V that are contained in W, 
and take F to be the set of all such flats Fy. We write PG,,(q) for 
PG, (E,); the case n = 2 was introduced in Chapter 19. 

Projective geometries PG,,(F) can be defined also for F a division 
ring (lacking commutativity of multiplication); see Crawley and 
Dilworth (1973). 


Let Y be any subset of the elements X of a combinatorial geom- 
etry (X,F) and let 


Ea{FPOY ek ert 
Then (Y,€) is also a combinatorial geometry, the subgeometry on 


Y. For example, AG,,(F) is a subgeometry of PG,(F). There is 
a standard embedding of the point set of AG,(F) into the point 
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set of PG,,(F): to each vector x in an n-dimensional vector space 
V, associate the 1-dimensional subspace that is the span of (x, 1) 
in the (n + 1)-dimensional space V x F. The image consists of all 
projective points not contained in the hyperplane V x 0. 

The set of flats F of a combinatorial geometry, when ordered by 
inclusion, has several important properties. 

A lattice L is a partially ordered set with the property that any 
finite subset S C L has a meet (or greatest lower bound), that is an 
element b in L so that 


Vaes|b < a] and Vacs 6G) => 65-0, 


as well as a join (or least upper bound), that is an element 6} in L 
so that 


Vaes|b = a] and Vee ic Sia)! eb. 


The meet and join of a two-element set S = {x,y} are denoted, 
respectively, by x A y and x V y. It is easily seen that A and V are 
commutative, associative, idempotent binary operations; moreover, 
if all two-element subsets have meets and joins, then any finite 
subset has a meet and a join. 

The lattices we will consider have the property that there are no 
infinite chains. Such a lattice has a (unique) least element (that we 
denote by 0;) because the condition that no infinite chains exist 
allows us to find a minimal element m, and any minimal element 
is minimum since if m ¢ a, then m A a would be less than m. 
Similarly, there is a unique largest element 1,. 

For elements a and b of a poset, we say that a covers b and write 
a > b when a > b but there are no elements c so that a > c > b. 
For example when U and W are linear subspaces of a vector space, 
U > W when U D W and dim(U) = dim(W) + 1. Recall that a 
chain in a partially ordered set P is a totally ordered subset of P. 
So a finite chain can be thought of as a sequence ag < ay <-+-+ < Gy. 
A point of a lattice L with miminum element 0; is an element that 
covers Or. 

A geometric lattice is a lattice L that has no infinite chains, and 
such that 


(1) L is atomic, that is, each element of L is the join of points 
of L, and 


306 A Course in Combinatorics 


(2) L is semimodular, that is, if a and b are distinct and both 
cover c in L, then a V 6 covers both a and b. 


Theorem 23.1. The set of flats of a combinatorial geometry, or- 
dered by inclusion, is a geometric lattice. Conversely, given a geo- 
metric lattice L with points X, then (X,{F, : y € L}) is a combi- 
natorial geometry, where 


Fy = 18 6X tas yf: 


PROOF: Since the set F of flats of a combinatorial geometry is 
closed under intersection, the poset of flats is an atomic lattice 
(the meet of two flats is their intersection; the join of two flats is 
the intersection of all flats containing both). Suppose that flats F 
and F5 cover a flat E in a combinatorial geometry. Let x be a point 
in F, \ Fy. There is a flat G2 that covers Fy in F and contains the 
point x. Gy contains F; since otherwise E GS F, G2 & F,. But by 
a symmetric argument, there is a flat G, that covers F; and also 
contains F) and F5. The join F; V Fy must be contained in both G1 
and G2. Since G; covers F;, Fy V Fy = Gy = Go. 

Let L be a geometric lattice and define the flats Fi, as subsets 
of the point set X of DL as in the statement of the theorem. It is 
clear that the empty set, any singleton, and X are flats (when y 
is taken to be Oz, a point of L, or 1z, respectively). Since x < y 
and x < z if and only if x < yAz, we have FyN F, = Fyyz; thus 
F :={F,:y € L} is closed under intersection. No point x not in a 
given flat F,, can be in two flats that cover F',, so it remains only to 
show that some flat that covers Fi, contains x. We show that Fv, 
covers fF, to complete the proof. 

This is equivalent to showing that «Vy > y in L whenever x £ y 
in L and « is a point of L. Choose a maximal chain 


Or <y <yo<ss << YyR=y. 


Since x and y; cover 07, x V y; covers y;. Since x V y; and yy cover 
Yi, LV Yo covers yo (clearly x V y; V yo = XV y2). Inductively, we 
find that x V yr covers Yp. 


In some sense, the difference between geometric lattices and com- 
binatorial geometries is the same as that between incidence struc- 
tures on the one hand and families of subsets of a set on the other. 
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Incidence structures and lattices are more abstract and must be 
used to avoid confusion in certain cases (e.g. when discussing du- 
ality or intervals), but we prefer the language and notation of sets 
and subsets for many arguments. 

For example, for elements a and b of a partially ordered set, 
the interval [a,b] is defined as [a,b] := {x : a <a < bh. Any 
interval in a lattice is again a lattice. The reader should check 
that any interval of a geometric lattice is again a geometric lattice. 
It would be awkward to state the corresponding fact in terms of 
combinatorial geometries. 

In view of Theorem 23.1 (more precisely, the simple correspon- 
dences decribed in the proof), it is often convenient (though some- 
times confusing) to mix the notation and language of combinatorial 
geometries and geometric lattices. For example, we can use the 
same symbol PG;,(F) to denote the combinatorial geometry and 
the corresponding lattice of subspaces of a vector space. 


Problem 23A. (i) Any interval of PG,,(F) is isomorphic as a poset 
to PG,,(F) for some m <n. (ii) PG,(F) is isomorphic to its dual 
poset (where order is reversed). 


Example 23.5. The partition lattices II,,, whose elements are all 
partitions of an n-set X, provide another family of geometric lat- 
tices, and hence give us combinatorial geometries. The partitions 
are ordered by refinement; A is a refinement of 6 if each block of 
B is the union of blocks of A. So the least element (the finest par- 
tition) is all singletons {{x} : x € X} and the greatest element is 
{X}, consisting of a single block. A moment’s thought shows that 
one partition covers another if the former is obtained by coalescing 
two blocks of the latter into one. The points are the partitions that 
consist of a single block of size 2 and n — 2 singletons. 
For example, I, has 15 elements: 


{1234} 
{123,4}, {124,3}, {134,2}, {234,1}, {12,34}, {13,24}, {14,23} 
419, 3,4}, {13,2,4}, {14,9,3}, {23,1,4}, {24.1.3}, 134, 1, 2} 
{1,2,3,4} 
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To simplify the notation, we have written {123,4}, for example, 
for what should be {{1, 2,3}, {4}}. 


Example 23.6. The points of II, are in one-to-one correspondence 
with the edges of K,. Given any simple graph G on n vertices, we 
obtain a geometric lattice L(G) whose points correspond to the 
edge set E(G) of G as follows. The elements of L are to be all 
those partitions A of V(G) such that the subgraph of G induced 
by each block of A is connected. The partitions with this property 
are exactly those that are the join (in II,,) of points (in I) corre- 
sponding to some subset of the edges of G. The lattices L(G) are 
sometimes called the lattices of contractions of G (see Chapter 33). 


Fig. 23.1 is an attempt to illustrate the combinatorial geometries 
L(k4) and L(K;5). L(Uks), for example, has ten points; the dot in 
the figure labeled 12 represents the point (partition) {12,3, 4,5}. 
The partitions of the type {123, 4,5}, for example, contain three 
points and are represented by line segments; lines with two points 
are not shown. (We note that the six point /seven line L(/‘4) is the 
geometry that arises from deleting a point from the Fano configu- 
ration. ) 


Figure 23.1 


For a subset S of the points of a combinatorial geometry, the 
closure S is the intersection of all flats containing S. For example, 
in a linear space, the closure of a two point set is the line containing 


the points. Exercise: show that S C S, A C B => A C B, and 
S=S. It will be occasionally convenient to use the symbol for join 
in the lattice of flats; note that 


BAP SEAL: 
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A subset S C X is independent when for each x € S,x ¢ S \ {x}. 
In Example 23.1, three points in a linear space are independent 
if and only if they are not contained in a line; no four points are 
independent. A set of points in PG,,(F) is independent if and only 
if representative vectors are linearly independent. A set of points 
in AG,,(F) is independent if and only if the vectors are affinely 
independent. One viewpoint of combinatorial geometries is that 
they represent the study of ‘abstract independence’. 

By the rank of a combinatorial geometry with point set X, we 
mean the maximum size of an independent subset of X. PG',(F) 
and AG;,(F) both have rank n+ 1. We will avoid the word di- 
mension, but if we forget and use it, it will mean the rank minus 
one. The maximum size of an independent subset of a flat F’ is the 
rank of that flat. The flats of rank 1 are called points, the flats of 
rank 2 are called lines, the flats of rank 3 are called planes. The 
flats of rank 1 less than rank(X) are called hyperplanes or copoints. 
Occasionally, we need to use the term coline for a flat for which the 
rank equals rank(X) — 2. 

From any combinatorial geometry (X,F), we obtain a linear 
space (X,£) where CL is the set of all lines (flats of rank 2). 

In Fig. 23.1, any two points determine a line that may have two 
or three points; only the latter are drawn in the figure. The planes 
in L(K;) have either four or six points; the latter contain four 
three-point lines. 

We collect some simple facts: 


Lemma 23.2. 


(1) Ife ¢ A but x € AU {y}, then y € AU {x} (the exchange 
axiom). 

(2) If S is an independent set of points in a geometry and x ¢ S, 
then {x} US is independent. 

(3) If F = §, then F = A for any maximal independent subset 
A of S. 


Proor: F, := AU {7} is the flat that covers E := A and contains 
x; Fy := AU {y} is the flat that covers F and contains y. If x € F», 
it must be that FP) = Ff). This proves (1). 

If {x} US is not independent, there is some y € S such that 
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y € (SU{a})\{y}. Let A := S\{y}. Since S is independent, 
y ¢ A. But (1) then implies that x € AU fy}, ie. x € S. This 
proves (2). 

Suppose F = § and that A is an independent subset of S that 
is maximal with respect to that property. If S ¢ A, we could find 
a larger independent subset of S by (2). So S C A and since F is 
the smallest flat containing S, F C A. This proves (3). 


A basis for a flat F is an independent subset B C F' so that 
B =F, i.e. any maximal independent subset of F. As an exercise, 
the reader should check that we could also have defined a basis of 
F as a minimal spanning set, i.e. a subset B of F so that B = F 
and that is minimal with respect to this property. 


Problem 23B. Let G be a connected simple graph. Show that 
the bases of the combinatorial geometry L(G) are exactly the edge 
sets of spanning trees in G. 


Theorem 23.3. All bases of a flat F' in a combinatorial geometry 
have the same finite cardinality (called the rank of F’). For flats E 
and F, 


(23.1) rank(£) + rank(F) > rank(E NF) + rank(E V F). 


PRooF: It should be clear that the condition that there are no 
infinite chains of flats forces all independent sets to be finite. 

If it is not true that all bases of F’ have the same cardinality, 
choose an ordered pair (B,, B) of bases with |B,| > |By| but so 
that |B» \ B,| is as small as possible subject to this constraint. Pick 
x € B,\ By. Then B, \ {x} is independent and has a closure which 
does not contain F' and hence does not contain By. Pick y € By\ B, 
such that y € B, \ {x}. Then (B, \ {x}) U {y} is independent by 
Lemma 23.2 and is contained in a basis B; for F’. Now we have an 
ordered pair (B3, B,) of bases with |B3| > |By|, but By \ Bs is a 
proper subset of By \ By, a contradiction. 

To prove the second part of the theorem, first note that Lemma 
23.2(2) implies that a basis for a flat E can be extended to a basis 
for any flat E’ containing EF. 

Now let B be a basis for ENF’. Extend B to bases B; and Bo for 
FE and F, respectively. Then any flat containing B,U Bz contains E 
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and F and hence EU F;; that is, Bj U Bp = EUF and so B, U By 
contains a basis for E U F.. Then 


rank(E U F) < |B, U Bo| = |By| + | Bo| = |B, a) Bo| 
= rank(£) + rank(F’) —rank(EN F). 


The inequality (23.1) is called the semimodular law. 

We remark that the proof of Theorem 23.3 shows that all maxi- 
mal independent subsets of any subset S, not necessarily a flat, have 
the same cardinality. This is a corollary of the statement for flats, 
however, because we can apply Theorem 23.4 to the subgeometry 
on S. 

As an exercise, make sure that you understand the following: Let 
E and F be flats in a geometry with EF C F. If rank(£) = rank(F), 
then EF = F; F covers E if and only if rank(F’) = rank(£) + 1. 

For the affine geometries AG,,(q), the number of points on a flat 
of rank r is gq". For the projective geometries PG',(q), the number 
of points on a flat of rank r is 


ge >, , 5 
Theorem 23.4. Suppose that every flat of rank i in a combina- 
torial geometry on v points has exactly k; points, 1 = 0,1,...,n. 
Then the total number of flats of rank r is 


(23.3) ec 


Furthermore, the set of points together with the family of rank r 
flats is a 2-design. 


PROOF: Since the flats of rank r+ 1 that contain a flat of rank 
r partition the remaining v — k, points into sets of size k,41 — k,, 
there must be exactly (uv — k,)/(k,+41 — k,) such flats. The formula 
(23.3) now follows by induction on r when we count the ordered 
pairs (£, F’) of rank r and rank r +1 flats with FE C F’. Note that 
kyo = 0 and k; = 1, so that (23.3) is valid for r = 1. 
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Any two points are contained in a unique rank 2 flat. The above 
argument implies that any rank 2 flat is contained in the same 
number of rank r flats. Hence the rank r flats give a 2-design. 


The numbers of rank r flats of PG',(q) are called Gaussian num- 
bers; see Chapter 24. Equations (23.2) and (23.3) imply that the 
numbers of points and hyperplanes of PG',(q) are equal (there are 
other ways of seeing this) and so we have the following corollary. 


Corollary. The points and hyperplanes of the projective geometry 
PG,,(q) form a (v,k, \)-symmetric design with 


Problem 23C. Show that the incidence structure whose points 
are the points of AG',(2) and whose blocks are the planes of AG;,(2) 
is a Steiner system S(3, 4, 2”). 


The following theorem is due to C. Greene (1970). 


Theorem 23.5. The number of hyperplanes in a finite combina- 
torial geometry is at least the number of points. 


PRooF: The proof will be similar to that of Theorem 19.1. First, 
we shall show that if a point x is not on a hyperplane H, then 
the number r, of hyperplanes on «x is at least the number ky of 
points on A by induction on the rank. The assertion is trivial for 
geometries of rank < 2. By the induction hypothesis, we know 
that the number of hyperplanes of the subgeometry on H (i.e. the 
number of colines C contained in H) is at least ky. But for each 
such coline C,, we get a hyperplane (the join of C and x) on z. 
Now we repeat from the proof of Theorem 19.1: Let # denote 
the set of hyperplanes, v := |X|, b:= |H|. Suppose b < v. Then 


Ge Do ae) b— rz) =) aia =a 
2eX Hoa © HeH c¢H 


and this implies that in all the inequalities, equality must hold. 
Therefore v = 6. 
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See the remark following Lemma 23.8 below concerning the case 
of equality in Theorem 23.5. 
In studying PG,,(F), the dimension formula 


dim(U NW) + dim(U + W) = dim(U) + dim(W) 


for linear subspaces of a vector space plays a crucial role. Equiva- 
lently, 


(23.4) rank(#£M F) +rank(E V F’) = rank(F) + rank(F’) 


for flats F and F of PG,(F). This is a stronger version of the 
semimodular law. Indeed, a combinatorial geometry in which (23.4) 
holds is said to be modular. In a modular combinatorial geometry, 
for example, (23.4) implies that any two lines contained in a plane 
must meet nontrivially (cf. Theorem 19.1). 


Problem 23D. Let (X,8) be the linear space consisting of the 
points and lines of a finite modular combinatorial geometry of rank 
3. Show that (X,B8) is either a nearpencil as defined following 
Theorem 19.1 or else is a projective plane as defined in Chapter 
19—that is, show that any two lines have the same cardinality 
n+ 1 and that there are exactly n?-+n-+1 points for some integer 
n> 2. 


We have introduced the projective geometries PG,,(F) and have 
defined projective planes in Chapter 19. Here is the general defi- 
nition: A projective geometry is a modular combinatorial geometry 
that is connected in the sense that the point set cannot be expressed 
as the union of two proper flats. 

We do not have the space or the time to prove here the following 
fundamental result. See Veblen and Young (1907) or Crawley and 
Dilworth (1973) for a proof. 


Theorem 23.6. Every projective geometry of rank n > 4 is iso- 
morphic to PG,(F) for some division ring F. 


Combinatorial geometries of rank < 2 are without great interest, 
but all are modular and, except for the two point line, are projective 
geometries. The rank 3 projective geometries are, by Problem 23D, 
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equivalent to projective planes as introduced in Chapter 19 (the 
nearpencils are not connected). 

Every modular combinatorial geometry can be put together from 
projective geometries. This is the content of the following problem. 


Problem 23E. 

(i) Let (X1, 71) and (X2, F2) be modular combinatorial geome- 
tries; X11. X2 = 0. Show that (X,UX9, {FiUF) : Fi € Fi, Fo € Fo}) 
is a modular combinatorial geometry. 

(ii) Let (X,F) be a modular combinatorial geometry such that 
X is the union of two flats X; and X» where X,;M X» = 0. Let 
FH {POX Fe Fh, 4S 1,2: Show that (XG, 7); 4 =, 2; is 
a modular combinatorial geometry and that F = {F, UF): Fy € 
Fi, Fy € Fo}. 


Problem 23F. Prove that every flat F' of a combinatorial geome- 
try (X,F) has a modular complement, i.e. there exists a flat EF such 
that ENF =0, EV F = X, and rank(£) + rank(F’) = rank(X). 


The linear spaces consisting of the points and lines of modular 
combinatorial geometries satisfy the following condition known as 
the Pasch axiom. A line that meets two sides of a triangle also 
meets the third. More precisely, suppose A, B,C are distinct lines 
and a,b,c distinct points with incidence as in Fig. 23.2. We require 
that any line L that contains a point other than a or b from the 
line C and a point other than a or c from the line B also contains a 
point other than b or c from the line A. See Fig. 23.2. To see that 
this holds in a modular geometry, notice that LZ and A must both 
be contained in the plane P = {a,b,c}, and the modular equation 
implies that two lines in a plane must meet nontrivially. 


Figure 23.2 
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Problem 23G. Consider the incidence structure whose points are 
the k-dimensional subspaces of a vector space V and whose lines 
are the (k + 1)-dimensional subspaces of V, with incidence being 
containment. Show that this incidence structure satisfies the Pasch 
axiom. 


Theorem 23.7. A finite linear space (X,A) for which the Pasch 
axiom holds, consists of the points and lines of some modular com- 
binatorial geometry on X. 


PROOF: Suppose the Pasch axiom holds. We must construct the 
flats of a combinatorial geometry, which we do as follows. We say 
that a subset S C X is a flat when, for any line L, 


IEA S|>2 implies DCS. 


Let F denote the set of all such flats; this includes the empty 
set, all singletons, and X, as well as all lines in A. Check that 
the definition implies that the intersection of any number of flats 
is again a flat. 

Let x be a point not in a flat S. Let T be the union of all 
lines joining x to the points s € S. We claim that this simple 
construction produces a flat 7’, and that T covers S. 

To this end, let D be a line containing two points t,t) of JT. We 
want to show L C T. If t1,t2 both belong to one of the lines on a, 
then L is that line and so L C JT. Otherwise both t; and ty are in 
S; then L CT, so assume t; ¢ S. Then t; and tz belong to distinct 
lines M, and My on x. Say M; meets S in s;, 7 = 1,2, and let N 
be the line joining s; and sy. We have N C S. The Pasch axiom 
guarantees that L meets N in some point z of S. 

Consider any other point ts; € L. Let M3 be the line joining x 
and t3. Since M3 meets two sides of the triangle with sides M,, 
L, and N (with vertices z, t;, and s,), it must meet the third side 
N in some point s3 of S. Then Ms is one of the lines joining x to 
points of S, so t3 € T. We have now proved that T is a flat. 

That T covers S is easy to see: If U is a flat, S S UST pick 
y € U\S. By construction, the line joining x and y is one of the 
lines on x that meets S; that line contains two points of U and 
hence is contained in U. This means x € U, whence T CU. 
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Since any point lies in a flat that covers a flat S, (X,F) isa 
combinatorial geometry. (The hypothesis that X is finite is only 
used here to ensure that there are no infinite chains of flats.) 

Let x be a point on a line LZ and let H be any hyperplane. Since 
(if x is not already in H) the join of H and x must be X, L must 
be one of the lines on x that meet H. That is, any line L and 
any hyperplane H meet nontrivially. This implies modularity by 
Lemma, 23.8 below. 


Lemma 23.8. A combinatorial geometry is modular if and only if 
any line and any hyperplane meet nontrivially. 


ProoF: The modular law (23.4) shows that any rank 2 flat and 
any rank n—1 flat in a rank n geometry meet in a flat of rank > 1. 

For the converse, we use induction on the rank. Assume that 
all lines and hyperplanes in a rank n combinatorial geometry meet 
nontrivially. Suppose H is a hyperplane and that there exists a line 
L CH and arank n-— 2 flat C C A that are disjoint. But then for 
any point x ¢ H, the hyperplane C V {x} would be disjoint from 
the line L. So by induction, the subgeometry on any hyperplane H 
is modular. 

Thus if there is a pair of flats EF and F' that provide a coun- 
terexample to the modular law (23.4), then EV F is equal to the 
entire point set X. Let H be a hyperplane containing F and let 
F’ := HOF. Then EV F" C H. But rank(F”) > rank(F’) — 1 since 
otherwise a modular complement of F” in F would have rank > 2 
and so would contain a line, one that would be disjoint from H. We 
see that EF and F” provide a counterexample to (23.4) contained in 
H, contradicting the modularity of the subgeometry H. 


Remark. Suppose the number of hyperplanes of a finite combi- 
natorial geometry (X,F) is equal to the number of points. Then, 
reviewing the proof of Theorem 23.5, r; = ky whenever x ¢ H. If 
we apply Theorem 23.5 to the interval |{z}, X] in any combinato- 
rial geometry, we find that r, > 1,, where |, denotes the number of 
lines containing x (since these lines are the points of the geomet- 
ric lattice [{x}, X]). For « ¢ H we obviously have |, > ky since 
the lines {x} V {y}, y € H, are distinct. In summary, equality in 
Theorem 23.5 implies that 1, = ky when x ¢ H. This means that 
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every line meets every hyperplane and hence (X,F) is modular by 
Lemma 23.8. 


Two triangles {a1, b1,c,} and {ag, by, co} are said to be perspective 
from a point if there exists a point p (the point of perspectivity) such 
that {p, a1, a2} are collinear, {p, bi, b2} are collinear, and {p, c1, co} 
are collinear. Two triangles {a1,b1,c;} and {dg, bo, c2} are said to 
be perspective from a line if there exists a line L (the line or azis 
of perspectivity) such that {L, Aj, Az} are concurrent, {L, By, Bo} 
are concurrent, and {L,C;, C2} are concurrent. Here we are using 
‘triangle’ to mean three noncollinear points, and A;, B;, and C; to 
denote the ‘sides’ of the triangle opposite a;, b;, and c;, respectively. 
That is, A; := {b;,¢;}, 4 = 1,2, ete. 


Figure 23.3 


In Fig. 23.3, the triangles are perspective from the point p and 
also perspective from the line LZ. The incidence stucture so illus- 
trated, with ten points and ten lines, each line on three points and 
each point on three lines, is called the Desargues configuration. The 
triangles need not lie in a plane. The right diagram in Fig. 23.1 
is isomorphic (!) to that in Fig. 23.3 as an incidence structure; 
the former is easier to imagine as 3-dimensional. It can be viewed 
in many ways, e.g. as showing that the triangles {25, 35,45} and 
{12, 13, 14} are perspective from the point 15 and also perspective 
from the line {23, 24, 34}. 

If we choose a basis for the (n+1)-dimensional vector space whose 
subspaces comprise PG,,(F), we can describe projective points by 
so-called homogeneous coordinates. By (#0,%1,...,2%n), we mean 
the 1-dimensional subspace spanned by the vector (%0,%1,.--,Zn). 
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The projective point (x0, 71,..-,2%n) is the same as (yo, Y1,---5 Yn) 
if and only if the vector (Yo, y1,---, Yn) is a nonzero scalar multiple 
of (2%, 2%1,---,;%n). The hyperplanes may also be described by ho- 
mogeneous (n+1)-tuples: [co, c1,..., Cn] will denote the hyperplane 
consisting of points with homogeneous coordinates (x0, 21,.--,2n) 
so that cot) + 41C¢, +--+ + 2p, = 0. 


Theorem 23.9 (Desargues’ theorem). In PG,,(F), if two tri- 
angles are perspective from a point, then they are perspective from 
a line. 


ProorF: Let triangles {a1, b;,¢,} and {ag, bo, co} be perspective from 
a point p. We are going to assume that no three of p, a,, b; and 
c, are collinear so that we do not spend too much time on details. 
We will also assume that all points lie in a plane, i.e. we prove the 
theorem for PG2(F). The other cases are left for the reader. 

We choose a basis for a 3-dimensional vector space so that the 
basis vectors x,y,z span the 1-dimensional subspaces aj, 61, and 
C1, respectively. Then the homogeneous coordinates for a;, b;, and 
c, are (1,0,0), (0,1,0), and (0,0,1), respectively. Then p has ho- 
mogeneous coordinates (a, 3,7) where all coordinates are nonzero. 
We replace the original basis vectors by ax, Gy and yz; then ay, 
bi, and c, are still represented by (1,0,0), (0,1,0), and (0,0, 1), re- 
spectively, while p now has homogeneous coordinates (1, 1,1). This 
will simplify our computations. 

It follows that aj = (a, 1,1), by = (1, 8,1), co = (1,1,7), for some 
a,?,y € F. The line joining a; and 6, is [0,0,1]; the line joining 
ag and by is [1 — 6,1 — a,aZ — 1] (check that both a» and be are 
on this line). The point on both lines is (1 — 8,a— 1,0). Similarly, 
the point of intersection of the lines joining b; and c, with the line 
joining b2 and cp is found to be (0,1 — a,y — 1), and the point of 
intersection of the lines joining c; and a; with the line joining cz and 
ag is found to be (3 — 1,0,1—y). The three points of intersection 
are collinear since their coordinates are linearly dependent. 

(If we are careful, the proof should go through also for division 
rings F.) 


Problem 23H. Prove that the linear space of PG,,(F) also satis- 
fies Pappus’ theorem when F is a (commutative) field, which is as 
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follows. Let a;, b;, and c; be collinear points, say on a line L;, for 
i = 1,2. Suppose L; and Lz meet in a point (not one of the six 
named points). Then the lines {a1, bz} and {a2, b;} meet in a point 
c, the lines {b1,c2} and {b2,c,} meet in a point a, and the lines 
{a1,¢c2} and {a2,ci} meet in a point b; and the three points a, b,c 
are collinear. 


Figure 23.4 


Projective planes in which the statement of Theorem 23.9 is valid 
(if two triangles are perspective from a point, then they are per- 
spective from a line) are called Desarguesian planes. Projective 
planes in which the conclusion of Problem 23H holds are called 
Pappian planes. We do not have the space or time to prove the 
further following fundamental result. See Crawley and Dilworth 
(1973). 


Theorem 23.10. 


(1) A projective geometry of rank 3 is isomorphic to PG(E) for 
some division ring E if and only if it is a Desarguesian plane. 

(2) A projective geometry of rank 3 is isomorphic to PG2(F) for 
some field F if and only if it is a Pappian plane. 


A corollary of Theorem 23.10 is that every Pappian plane is De- 
sarguesian. For a synthetic (i.e. coordinate-free) proof that Pap- 
pus’ theorem implies Desargues’ theorem, see D. Pedoe (1963). We 
remark that every finite division ring is a field (Wedderburn’s the- 
orem), so that every finite Desarguesian plane is Pappian; see M. 
Hall, Jr. (1972). 

We describe one method of obtaining finite projective planes that 
are not Desarguesian. 
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Let G be an abelian group of order n?, and suppose that there 
exist subgroups Ho, H,,...,H, of order n that partition the nonzero 
elements of G, that is such that H; H; = {0} for i # 7 and, 
therefore, 


LH \ {0} = G \ {0}. 
i=0 


Then, we claim, the incidence structure whose points are the el- 
ements of G and whose lines are all cosets (translates) of all the 
subgroups H; is an affine plane. In general, the intersection of cosets 
of two subgroups H and K is either empty or a coset of HN K, 
so lines meet in at most one point. We could now count covered 
pairs to show that every two distinct points are contained in a line, 
but it is just as simple to note that given points x, y € G, the coset 
Ay + y contains both points where Hy is the subgroup containing 
the difference x — y. An affine plane obtained from a group G and 
n+ 1 subgroups in this way is called an affine translation plane. It 
is known that G must be elementary abelian for such subgroups to 
exist; see J. André (1954). The affine plane may be completed to a 
projective plane as in Problem 19K; any projective plane obtained 
in this way is a translation plane. 


Example 23.7. Here is a partition of the nonzero elements of F# 


into ten subgroups Ho, H,..., Ho. (We have denoted the elements 
as strings, dropping commas and parentheses.) 


£0000, 1000, 2000, 0100, 0200, 1100, 2200, 2100, 1200} 
£0000, 0010, 0020, 0001, 0002, 0011, 0022, 0021, 0012} 
£0000, 1010, 2020, 0101, 0202, 1111, 2222, 2121, 1212} 
£0000, 2010, 1020, 0201, 0102, 2211, 1122, 1221, 2112} 
£0000, 0110, 0220, 2001, 1002, 2111, 1222, 2221, 1112} 
£0000, 0210, 0120, 1001, 2002, 1211, 2122, 1121, 2212} ° 
£0000, 1110, 2220, 2101, 1202, 0211, 0122, 1021, 2012} 
£0000, 2210, 1120, 1201, 2102, 0111, 0222, 2021, 1012} 
£0000, 2110, 1220, 2201, 1102, 1011, 2022, 0121, 0212} 
£0000, 1210, 2120, 1101, 2202, 2011, 1022, 0221, 0112} 


The affine plane corresponding to this ‘spread’ of subgroups is the 
Desarguesian plane. 
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But note that the elements of the first four subgroups, for exam- 
ple, can be partitioned into subgroups in another way, as indicated 
by the columns below. 


{0000, {0000, {0000, {0000, 
1000,2000, 0100,0200, 1100,2200, 2100, 1200, 
0010,0020, 0001,0002, 0011,0022, 0021,0012, 
1010, 2020, 0101,0202, 1111,2222, 2121, 1212, 
2010, 1020} 0201,0102} 2211,1122} 1221,2112} 


The above example is the special case gq = 3 of a construction we 
now describe. Let V be a 2-dimensional vector space over Fj2 and 
let Ho, M,,...,H 2 be the 1-dimensional subspaces over Fj2. If we 
consider V as a 4-dimensional vector space over F,, the subspaces 
H,; are 2-dimensional over F, (they form a spread of lines in PG'3(q); 
see Theorem 24.3). Let U be any 2-dimensional subspace over F, 
other than one of the H;,’s. Then U meets any H; in {0} or ina 
1-dimensional subspace over F,. Say U meets Ho, Mi,..., Hy in q 
points each and Hy41, Hg42,...,H in the zero vector only. Con- 
sider the F,-subspaces aU as a ranges over the nonzero elements of 
F,2; since multiplying by the nonzero elements of F, fixes U, there 
are only (q?—1)/(q—1) distinct such subspaces, say Up, Ui,..., U4. 
These ¢+ 1 subspaces U; meet pairwise only in the zero vector and 
their union is Hj) UH, U---U Ay. Thus 


Uo, U1, cn) Ug, Agai, Ho+2, oo gle 


is another partition of V into subgroups of order q?. 

We shall not prove here that the projective planes resulting from 
this construction are not Desarguesian (for g > 2). See Theorem 
10.9 in D. R. Hughes and F. C. Piper (1973). The reader can find 
counterexamples to Desargues’ theorem in Example 23.7 by hand. 

We conclude the chapter with proof of an important step in the 
difficult proof of Theorem 23.6. We need a preliminary observation. 


Proposition 23.11. The subgeometry on any flat of a projective 
geometry is again a projective geometry. 


PROOF: That the modular law holds in the subgeometry on a flat 
of a modular projective geometry is immediate. We must show that 
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these subgeometries are connected. To this end, it will suffice to 
show that if H is a hyperplane in a projective geometry with point 
set X, then the subgeometry on H is connected. 

If not, H = EUF where E and F are flats. By (23.4), rank(£)+ 
rank(F’) = n—1 where n := rank(X). Let £,,...,H, and Fi,..., Fi 
be the flats that cover FE and F’,, respectively, that contain points 
outside of H; each family partitions the points outside of H. Sup- 
pose that both s,t > 2. By (23.4), there exists 7; € FE; Fj, i= 1,2. 
The line {21,22}, again by (23.4), meets H in some point e € E. 
But then x, being on the line joining e and x;, must be contained 
in EF), a contradiction. Thus one of s or t must be 1, say s = 1, and 
then X is the union of the disjoint flats F and FE), contradicting 
the connectivity of the original projective geometry. 


A corollary of Proposition 23.11 is that every line of a projective 
geometry has at least three points (since a two-point line is not 
connected). This is used at a critical point in the proof below. We 
remark that with the aid of Problem 23D, it is easy to see that all 
lines in a finite projective geometry have the same size n + 1. 


Theorem 23.12. Desargues’ theorem holds in any projective ge- 
ometry of rank > 4. 


PROOF: Consider two triangles {a,, b,c} and {dg, bz, co} that are 
perspective from a point x. 

We first assume that the planes P; and P: spanned by {a1, bi, c1} 
and {de, b2,c2}, respectively, are distinct. Let T := {x,a1,b1,c1}. 
The lines {7,a;}, {v,b;}, and {z,c,} contain, respectively, the 
points ag, be, and cy, so P,; and P» are contained in T, that evi- 
dently has rank 4. So by the modular law, P, and P) must meet in 
a line L. We claim that the two original triangles are perspective 
from the line L. 

The plane Q := {p,a,};} contains aj and by and hence contains 


both the lines {a 1, b;} and {ag, by} that therefore intersect in a point 
q, say. The point g belongs to both P,; and P 2, and hence q € L. 
Similarly, the lines {b;,c;}, i = 1,2, meet in a point on L; and the 


lines {a;,c;}, 7 = 1,2, meet in a point on L. The two triangles are 
thus perspective from L. 
Now assume that the two original triangles lie in a plane P. 
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Let x; be any point not in P and x2 any other point on the line 
containing x; and p. The lines {x),a,} and {x2, a2} are contained 
in the plane {p,x1,a;} and so meet in a point a*. Similarly, the 
lines {21,b;} and {22, bo} meet in a point b* and the lines {21,1} 
and {x2,c2} meet in a point c*. The plane P* := {a*, b*,c*} and P 
are contained in a rank 4 flat and so meet in a line D:= P/N P*. 
The triangles {a1, b1,c,} and {a*,b*,c*} are perspective from the 
point x; and lie in different planes, and so are perspective from the 
line L. Similarly, the triangles {a2, b2,co} and {a*,b*,c*} are per- 
spective from x2, and so are perspective from L. The line {aj, bi} 
meets L in the same point as the line {a*, b*}; the line {a2, b2} meets 
L in the same point as the line {a*, b*}; hence the lines {a 1, b;} and 
{a2,b2} meet in a point of L. Similarly, the other corresponding 
‘sides’ of the two original triangles meet in points of L. In summary, 
the original triangles are perspective from the line L. 


Problem 231. Determine the number of flats of rank n — k in the 
partition lattice II). 


Problem 23J. For an arbitrary block design (with parameters 
v, k, A, b,r) we define a line to be the intersection of all blocks con- 
taining a pair of points x,y. Prove: 

(i) Any two points are on a unique line; 

(ii) If L is a line, then 2 < |L| < ©; 

(iii) If a line meets a block in two points, it is contained in the 
block. 


Notes. 

Girard Desargues (1593-1662) was an architect and military engi- 
neer who introduced the term involution in his work on perspective. 
He worked in Lyons and Paris. The work of Pappus of Alexandria 
(fl. 320), the last significant Greek geometer, was considerably ear- 
lier. The mathematical discipline of projective geometry did not 
develop until the 19th century. 

Moritz Pasch was originally an algebraist but then became inter- 
ested in non-euclidean geometry. He came to Giessen at the age of 
27 as Privatdozent and stayed there until his death (60 years later). 
He was Rektor of the university 1893-1894. 
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An interesting family of combinatorial geometries called transver- 
sal geometries arises from bipartite graphs; see Crapo and Rota 
(1970). The independent sets of these geometries are those subsets 
of one part X of a bipartition (X,Y) with the property that they 
can be matched into Y. 

We have seen that the concept of combinatorial geometries and 
geometric lattices are ‘cryptomorphic’, i.e. they are essentially dif- 
ferent axiom systems for the same structures. Crapo and Rota 
(1970) give further cryptomorphic versions. For example, ‘ma- 
troids’ are more-or-less the same thing as combinatorial geometries, 
though the emphasis of the theory of matroids is different. 

Some characterizations of the symmetric design consisting of the 
points and hyperplanes of a finite projective space may be found 
in Section 2.1 of P. Dembowski (1968). 
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24 


Gaussian numbers and 
qg-analogues 


There exist many analogies between the partially ordered set of all 
subsets of a finite set and the partially ordered set of all subspaces 
of a finite vector space. This is primarily because they are both 
examples of ‘matroid designs’ as defined in the previous chapter. 
Let V,(q) denote an n-dimensional vector space over the field F, 
of q elements. We use the term k-subspace as a short form of k- 
dimensional subspace. 

We begin with some counting. To obtain a maximal chain (i.e. 
a chain of size n + 1 containing one subspace of each possible di- 
mension) in the partially ordered set of all subspaces of V,,(q), we 
start with the 0-subspace. After we have chosen an i-subspace U;, 
0 <i <n, we can choose an (i + 1)-subspace U;,; that contains 
U; in (q” — q')/(q’** — @') ways since we can take the span of U; 
and any of the (q” —q') vectors not in U;—but any (i+ 1)-subspace 
will arise exactly (q’*! — q') times in this manner. In summary, the 
number of maximal chains of subspaces in V,,(q) is 


eget gee ees gee Iga) 


Reo : q=1)" 


We may consider M(n,q) as a polynomial in q for each integer 
n. When the indeterminate q is replaced by a prime power, we 
have the number of maximal chains in the poset PG',(q). When 
q is replaced by 1, we have M(n,1) = n!, which is the number of 
maximal chains in the poset of subsets of an n-set. 

The Gaussian number i] can be defined as the number of k- 


subspaces of V,,(q). These are called Gaussian coefficients by some 
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authors to emphasize the analogy with binomial coefficients. To 
find an expression for Velie we count the number N of pairs (U,C) 
where U is a k-subspace and C is a maximal chain that contains 
U. Of course, every maximal chain contains exactly one subspace 
of dimension k, so N = M(n,q). On the other hand, we get each 
such maximal chain uniquely by appending to a maximal chain 
in the partially ordered set of all subspaces of U, of which there 
are M(k,q), a maximal chain in the partially ordered set of all 
subspaces of V,,(q) that contain U; there are M(n — k,q) of these, 
since the partially ordered set {W : U CW CV} is isomorphic to 
the partially ordered set of subspaces of the factor space V/U that 
has dimension n — k. Thus 


A = M(n,q) (q” — 1)(q""1= 1)- +: (g-F#= 1) 
q M(k,q)M(n—k,q) (g¥ — 1)(qF!— 1)---(¢—-D) 


i a3 


For some purposes, it is better to think of Lila as a polynomial 
in an indeterminate qg rather than as a function of a prime power 
q. That the rational function above is in fact a polynomial can 
be seen in several ways. For example, it is an easy exercise to see 
that a rational function in x which is integral for infinitely many 
integral values of x must be a polynomial in x. Perhaps Gaussian 
polynomial is a better term than Gaussian number or coefficient. 
As an example, 


6 : 
i = 9 + q+ 2q' + 39° + 3q° + 3q4 + 3q? + 2¢? +g 41. 
q 


n 


When the indeterminate q is replaced by 1 in [ a we obtain 


ae This explains a small part of a tendency for results concerning 
finite vector spaces to reduce to the corresponding results for sets 
when q is replaced by 1. It is also possible to have so-called q- 
analoques of results on sets where we try to replace ‘k-subset’ by 
‘k-subspace’. Sometimes these statements are true and have proofs 
that are similar to the results on sets. 

The following is the g-analogue of Sperner’s theorem, Theorem 
6.3: 


24. Gaussian numbers and q-analogues o2T 


Theorem 24.1. If A is an antichain in the partially ordered set 
of all subspaces of V,,(q), then 


sa el 


PROOF: Let A be an antichain and count the number N of pairs 
(U,C) where U € A and C is a maximal chain that contains U. 
Every maximal chain contains at most one subspace in A, so N < 
M(n,q). On the other hand, each k-subspace in A lies in exactly 
M(k,q)M(n —k,q) maximal chains C. Thus 


M(n,q) > N = )_cxM(k,q)M(n—k,q), 
k=0 


where c;, is the number of k-dimensional subspaces belonging to 
A. The proof is completed in a manner analogous to the proof of 


Theorem 6.3 if we believe that lila < lea for all k, and this is 
q 


left to the reader to verify. 


The following theorem gives a combinatorial interpretation of the 


coefficients of the lila as a polynomial in qg, and thus proves they 
are all positive integers. 
Theorem 24.2. Let 


k(n—k) 
7 £ 
Gls 1g wate 


Then the coefficient az is the number of partitions of € whose Ferrers 
diagrams fit in a box of sizek x n—k. 


PROOF: We can work with the vector space of n-tuples over 
f,. It is well known that every k-subspace of F/’ arises uniquely as 
the rowspace of a k x n matrix over F, that (i) has rank k and 
(ii) is a so-called row-reduced echelon form. This means that the 
first nonzero entry in each row is a 1, the entries above a leading 1 
are 0’s, and the leading 1 in row 2 is further to the right than the 
leading 1 in rowi—1,i=2,3,...,k. 


yn 
q 
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Suppose the leading 1 in row 7 occurs in column c;, 2 = 1,2,...,k. 
Then (n—k+1—c1,n—k+2-—c,...,m2—1—cp_1,n —cg) is a non- 
increasing sequence of nonnegative numbers and so, when terminal 
zeros are dropped, corresponds to a partition of some number into 
at most k parts of size at most n — k. Conversely, such a partition 
gives us the positions of leading ones in a class of echelon forms. 

For example, there are 20 classes of echelon forms in the case 
n = 6, k = 3; see Fig. 24.1. 


100¢c¢ee lOe0ee l1Ocee(le 10<e«ee(0 leQ0Oee 
fososes| fissoee] fisseoe| foseeeo] fotos] 
O0Oleece O00lee 0O000l1e 000001 O00lee 
leOdeODe leQeedl lee0d0e lee(lel leeel0 
fooreos | fone] fosoio. | foo] [oso] 
0O0001e 000001 O0001e 000001 000001 
O100¢e6« O10e0e O1l0ee0 Ole0d0e Ole0eD 
fooroee| foossoe | foie] fovoio. | [ovo] 
O00lee 0O000l1e 000001 0O0001e 000001 
OleeI0 001006¢ 0010¢0 001¢00 000100 
fooo00] fovoro. | foo eo] fov000] fooo00] 
000001 0O000l1e 000001 000001 000001 
Figure 24.1 


In each matrix of Fig. 24.1, the positions of the dots, when the 
intervening columns are deleted and we reflect through a vertical 
axis, describe the Ferrers diagram of a partition (of some number 
<9) that fits in a 3 by 3 box. The class represented by the last 
matrix in the first row, for example, contains q’ echelon forms. 

In general, the class of echelon forms where the leading 1 in row 
i occurs in column c;, i = 1,2,...,k, contains g’ matrices for some 
£ since the positions not containing 1’s or required to be 0’s may 
be filled arbitrarily with elements of F,. To be precise, 


£=(n—k4+1-—c)+---+(n—-—1—-cg_1) + (n— cg) 


because for each i = 1,2,...,k, there are n — (k — i) — c positions 
in the i-th row that may be arbitrarily filled. That is, the class 
consists of g’ matrices where the partition into at most k parts of 
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size at most n—k that corresponds to the class is in fact a partition 
of the number @. 
Thus when ay is defined as the number of partitions of 4 whose 
Ferrers diagrams fit in a box of size k x n—k, we have a polynomial 
jee arg’ that agrees with the polynomial ila when evaluated 


at any prime power q; hence the two polynomials are equal. 


We remark that when we set g = 1 in the statement of Theorem 
24.2, we obtain as a corollary the result that the total number of 
partitions whose Ferrers diagram fits in a box of size k x n—k is 
(i): 

Problem 24A. Show this directly. 


Next, we derive a recursion (24.1) for the Gaussian numbers. 
This provides another way to see that they are polynomials in q 
(by induction on n, say). Pick a hyperplane H, i.e. an (n — 1)- 
subspace, of V,(q). Some k-subspaces are contained in H (their 


number is a) and the rest meet H in a (k — 1)-subspace. Each 


of the au (k — 1)-subspaces in H is contained in 


n—-k+1 Gees 
i eed 


k-subspaces of V, of which 
|" — = gee 
1 4 q-1 


are contained in H; this leaves g”~* that are not contained in H. 
Thus 


ay GhePy refi, 


Problem 24B. Determine the exponents e; (they may be func- 
tions of m, n, and & as well as 7) so that the following identity is 


valid: ; 
Paleo rier 


q 
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(One way to solve this involves echelon forms; another uses (24.1).) 


The equation of Problem 24B is a g-analoque of the equation 


C2)-EOC) 


for binomial coefficients. The g-analogue of inclusion-exclusion ap- 
pears in the next chapter; see Theorem 25.2 for an application. 

What is the q-analogue of (simple) t-designs? This would be 
a family B of k-subspaces of V,(q) such that each t-subspace is 
contained in exactly 4 members of 6. No nontrivial examples of 
such ‘q-S)(t,k,v)’s’ were known with t > 2 until 1986 when S. 
Thomas described a family with 


GSa2, ASG tH=2. Bas. SS ly fiod:6): 


The case t = \ = 1 is already nontrivial for vector spaces. For 
sets, an S(1,k,v) is a partition of a v-set into k-subsets, and this 
exists if and only if & divides v. 


Theorem 24.3. There exists a family of k-subspaces of V,,(q) such 
that every 1-subspace is contained in exactly one member of the 
family (a so-called spread of k-subspaces) if and only if k divides v. 


PROOF: In other words, we are interested in families of k-subspaces 
such that any two of them intersect in the 0-subspace but such that 
their union is the entire v-dimensional vector space. 

The number of k-subspaces that we require is the total number 
of nonzero vectors divided by the number of nonzero vectors in a 
k-subspace, and this is (q’ — 1)/(q* — 1). This is an integer if and 
only if k divides v. 

Suppose v = km where m is an integer. As a v-dimensional 
vector space over F,, we take an m-dimensional vector space V 
over Fx. Let B be the family of 1-subspaces of V as a vector space 
over Fx. (So there are ghm-D) 4 gk(m—2) 4....4 gk + 1 of these and 
each contains g* vectors.) Now think of V as a vector space over 
F,, a subfield of Fj.; the members of B are k-dimensional subspaces 
over F, and provide the required partition. 
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Problem 24C. Let B be a spread of k-subspaces in V,(q). Let A 
consist of all cosets of all members of B. Show that A is the block 
set of an S(2,q*,q’) on the set of points V,(q). (The case v = 2k 
is particularly interesting because we obtain affine planes of order 
q*. Planes obtained from spreads are called translation planes.) 


Problem 24D. Prove that 


n 


Sig? =1)...(qr tt — 1)q(2) rl ge 


k=0 


The Erdés-Ko-Rado Theorem (see Theorem 6.4) can be formu- 
lated in two (equivalent) ways. In one formulation, the bound for 
m is (—) and in the other it is k . (aE where G) counts the total 
number of k-subsets of the n-set. 

Now consider a possible g-analogue. Start with a collection A := 
{Aji, Ao,..., Am} of m distinct k-dimensional subspaces of the n- 
dimensional vectorspace V(n,q) over F, such that any two of the 
A; intersect in a subspace of dimension > 1. Then one can either 
conjecture that if k < 5n 


m< wae or ee iM : 
k-1 n |k 
q q 
In this case, the bounds differ. The first one is stronger than the 
second one. For a proof of the first bound see W. N. Hsieh (1975). 
This result was strengthened by P. Frankl and R. M. Wilson (1986). 


The second bound can be proved by generalizing the proof of The- 
orem 6.4 which is the next problem. 


Problem 24E. Consider the situation described above. Let 7 := 
(X1,X2,---,X,) run through the set B,, of all ordered bases of V (n, q). 
We place 7 on a circle. If a sequence of k consecutive vectors x; 
(on the circle) is a basis for Aj, we say A; € 7. 

(i) Show that for a fixed 7, we have |AN a| < k. 

(ii) For a fixed A;, count the number of bases 7 € B,, such that 
A; € TT. 

(iii) Prove that 
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Notes. 
It has been suggested that one should use the notation n? for 
M(n,¢@) (which is to be pronounced ‘n q-torial’). Then we have 


Mice 


Also note 
Dan? =n! (that is, n!). 
We do not know who first proposed this. 

The qg-analogue of Ramsey’s theorem was a long outstanding 
problem until it was solved by R. L. Graham, K. Leeb, and B. 
L. Rothschild in 1972. 

Car] Friedrich Gauss (1777-1855) was possibly the greatest math- 
ematician (scientist) of all time. He made extremely important 
contributions to number theory. He was the first to study proper- 
ties of the expressions that are now called Gaussian polynomials or 
Gaussian numbers. 
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Lattices and Mobius inversion 


One of the techniques belonging to the foundations of combinatorics 
is the principle of Mobius inversion over partially ordered sets. This 
can be thought of as a generalization of inclusion-exclusion as well 
as an extension of the inversion with the classical Mobius function 
of number theory, which was discussed in Chapter 10. 

Let P be a finite partially ordered set. We will consider matrices 
a whose rows and columns are indexed by the elements of P, that 
is mappings from P x P to the rationals or complex numbers. The 
incidence algebra A(P) consists of all matrices a such that a(x, y) = 
0 unless x < y in P. By the definition of matrix multiplication, 


(a8)(x,y) = >> a(a, z) A(z, y). 


zEP 


If a, 3 € A(P), then the above sum need be extended over only 
those z in the interval |x, y] := {x < z < y}. It is easily seen that 
A(P) is closed under multiplication, as well as addition and scalar 
multiplication. 

An element of A(P) that will play an important role in the fol- 
lowing theory is ¢ (the zeta function of P) which is defined by 


Meare iy Gia gee, 
ae 0 otherwise. 
We claim that ¢ is invertible and that its inverse, which is called 
the Mobius function of P and is denoted by yp, is integral and lies 
in A(P). 

This is simple to verify. The equation w¢ = I (the identity) 
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requires that 


(25.1) Ss" ez) = { aoe 


ze O otherwise, 


and this can be ensured by simply defining ys inductively, by declar- 
ine ula). aay) Oe ayy anid 
(25.2) p(x, y) = — S- pes), fora<y in P. 


wsz<y 


For example, when P = {1,2,3,4,6,12}, the lattice of all (positive) 
divisors of the integer 12, 


i > ak tie Sp. i a a 
0 4, A Of. a. ap Saas cen “4 
OO. 20. 44 OG: oh. Ub. Sah 26 
Ce) tt ot Pea oe OE Oe 
Gao ie! dort a ane a ae rr 
G50 0" 0. Or 4 0 0 0 0 0 1 


The top row of ~ was computed as follows: 


w(1,1) = +1, 
w(1,2) = —p,1) = -1, 
w(1,3) = —w0, 1) = -1, 
w(1,4) = —(1, 1) — w(1, 2) = 0, 
w(1,6) = —w(1, 1) — w(1, 2) — v1, 8) = +1, 
w(1, 12) = —w(1, 1) — #1, 2) — 41,3) — w(1, 4) — w(, 6) = 0. 


We remark that a somewhat different equation concerning ju re- 
sults if we consider instead the relation Cu = I: 


(25.3) we ={ 5 other 


rwise. 
foe 0 otherwise 


Another much more complicated way of looking at the matter 
of the invertibility of ¢ is as follows. We first remark that a finite 
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partial order can always be ‘dominated’ by a total order; that is, 
there exists an indexing P = {21,%2,...,%n} such that 2; < x; in 
P implies that 7 < 7. (Proof: Let x be any maximal element of P, 
index P\{x} by induction, and put x at the end.) With respect to 
such an indexing of P, the matrices in A(P) are upper triangular. 
The zeta function has 1’s on the diagonal and so has determinant 1 
and hence an integral inverse by Cramer’s formula. Then ¢~! = pu 
lies in A(P) because the inverse of any matrix (or element of a 
finite-dimensional commutative algebra) is a polynomial in that 
matrix (or element). 

The equation (25.2) implies that u(x, y) = —lif x < y. Also note 
that p(x,y) = k — 1 if the interval [x,y] is a k-point line, i.e. the 
interval consists of x, y, and k pairwise noncomparable elements 
Z1,.--,Zk, each with x < 2; < y. 

In the following theorem, we list the values of the Mobius function 
of some common partially ordered sets. The proof of Theorem 25.1 
will not be given until the end of the chapter because we wish to 
talk about the implications of Mobius inversion first. Clever and/or 
industrious readers will be able to prove some parts of Theorem 
25.1 with the recursion (25.2) and simple induction arguments, but 
we prefer to wait until we have Weisner’s theorem, Theorem 25.3, 
available. We do not prove part (v) here—see the notes. 


Theorem 25.1. (i) For the lattice of all subsets of an n-set X, 


ph ae 
ok, = 0 otherwise. 


(ii) For the lattice of all (positive) divisors of an integer n, 


nab) = { 


(That is, (a,b) = (2) where the latter p is the classical function 
of a single variable in (10.8).) 


(—1)" if 4 is the product of r distinct primes, 


0 otherwise, i.e. if a {b or b is not squarefree. 


(iii) For the lattice of all subspaces of a finite-dimensional vector 
space V over the field F, of q elements, 


(—1)'g(2) ifU CW and dim(W) — dim(U) = k, 


ace) =| 0 ifU LW. 


336 A Course in Combinatorics 
(iv) For the lattice IL, of all partitions of an n-set X, 
(A,B) = (-1)4-8 TT] (np — 0! 
BeB 


for elements A and B of I], with A < Bb, where ng denotes the 
number of blocks of A that are contained in a block B of B. 


(v) For the lattice of faces of a convex polytope, 


(S1)o ae) ifA Cc B, 


otherwise. 


nA, B) = { 


We now state the principle of Mobius inversion. Let P be a finite 
partially ordered set and ps its Mobius function. Let f,g,h: P—R 
(or into any additive group) be functions such that the relations 


g(t) = D0 f(a) and A(x) = D7 f(b) 


aia<x b:b>a 


hold for all x € P. Then, for all x € P, 


(25.4) f(x) = Sy wla,2)g(a), 


and 
(25.5) f(a) =D) u(x, bh). 
b:b>a 


These are readily verified by direct substitution. For example, the 
right-hand side of (25.4) is 


X wee (709) =H 10H wea), 
aia<x b:b<a b:b<ax a:b<a<ax 


But the inner sum on the right is zero by (25.3) unless b = x, so only 
the term f(x) survives. Or, in matrix notation (when we regard 
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f, g, and h as row or column vectors—whichever is appropriate— 
whose coordinates are indexed by P), the relations assumed for g 
and h are equivalent to g = Cf, so f = wg, and h= f¢, so f = hu. 

When the principle of Mobius inversion is applied to the lattice of 
subsets of a set X, we recover inclusion-exclusion. Let (A; : 7 € J) 
be a finite family of subsets of a finite set X. For J C J, let f(J) 
equal the number of elements of X that belong to exactly the sets 
A; with j € J and to no others. (Think of the size of one of the 
regions in a Venn diagram of the sets.) Let g(J) equal the number 
of elements in NjeyA;. Then g(J) = Sox.jcn f(K), so Mobius 
inversion gives 

FJ)= So (1)! lg(k). 


KICK 


In the case J = J), the above can be written 


Ix — Ail = S3(-0)*!| 1) jl 


ie. KCI jek 


which is a cryptic way of stating inclusion-exclusion. 

When the principle of Mobius inversion is applied to the lattice of 
(positive) divisors of an integer n, we recover the classical number- 
theoretic Mobius inversion. If f and g satisfy 


g(m) = >: f(k) for all m dividing n, 
k|m 


then Mobius inversion gives 


Geo Y= u(k,m)g(k) for all m dividing n, 
k|m 


and this, in view of Theorem 25.1(ii), is equivalent to Theorem 
10.4. 

We give a g-analogue of Example 10.2, where we used inclusion- 
exclusion to find the expression )7j"_)(—1)"~* (")k” for the number 
of surjections from an n-set to an m-set. 
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Theorem 25.2. The number of surjective linear transformations 
from an n-dimensional vector space to an m-dimensional vector 
space V over F, is 


ProoF: For a subspace U C V, let f(U) denote the number of 
linear transformations whose image is U. Let g(U) denote the 
number of linear transformations whose image is contained in U. 
Clearly, 


gU)= >> f(W) 


W:Wcu 
and g(U) is q”” if dim(U) =r. By Mobius inversion on the lattice 


of subspaces of V, 


fU)= SY) wWv)gren™, 
W:WCU 


Take U = V and use Theorem 25.1 (iii) to produce the stated result. 


Corollary. The number of n x m matrices over the field F, that 
have rank r is 


r 


Pa] Soo fg] em 


k=0 


We note that the number of injective linear transformations has 
a relatively simple form. If we fix a basis for an n-dimensional 
vector space and consider injections into an m-dimensional vector 
space, the image of the 7-th basis vector must be chosen as one of 
the (q™—q'') vectors not in the span of the images of the previous 
basis vectors. In summary, there are (q’” —1)(q"—q)---(q¢”—q"') 
injective linear transformations. Since Theorem 25.2 with m = n 
also gives an expression for this number, we have proved an identity. 
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Problem 25A. (i) Use Mébius inversion to derive an expression 
for the number of k-subspaces that meet trivially a given r-subspace 
of an n-dimensional vector space over F,. This should give a q- 
analogue to Eq. (10.5). (ii) For the special case when r + k = n, 
show, from another point of view, that there are exactly g’* such 
subspaces by consideration of r x n matrices over F, of the form 
(I M), where I is an identity of order r. 


Problem 25B. Use Mobius inversion to derive an expression for 
the number of nonsingular linear mappings of an n-dimensional 
vector space over F, to itself that fix no vectors other than 0, i.e. 
that do not have 1 as an eigenvalue. (These will be orthomorphisms 
of the additive group; see the Remarks following Theorem 22.9.) 


The partially ordered sets mentioned in the statement of Theo- 
rem 25.1 are all lattices. The following useful theorem was found 
by L. Weisner (1935). 


Theorem 25.3. Let uw be the Mobius function of a finite lattice L 
and let a € L witha >O;. Then 


S- u(Or, x) = 0. 
aaVa=1zr 
PROOF: Fix a and consider 


S:= D) nO,2)¢(2,y)C(a, =>) Dd H(0, 2) Hy, 1). 


xyeL rel yee, 
y>a 


Now on the one hand, 


c= 2, H(0,2) S> u(y); 


y>u 
yoa 


but y >a and y > x if and only if y > x Va, and the inner sum is 


I af ev @ =i, 
>) #1) = 


Src 0 ifxVa<l. 


340 A Course in Combinatorics 


Thus S is the sum in the statement of the theorem. On the other 


hand, 
S=S > uly,1) SY” p(0,2), 


y2a O<x<y 


and the inner sum is always 0 since y > 0. 


Corollary. For elements x,y of a geometric lattice L with x < y, 
u(a,y) has sign (—1)"™k)-r2k(@) and, in particular, is never 0. 


PROOF: We show that j(0;,1,) has sign (—1)""* by induction 
on the rank of L. Pick a point p € L. By semimodularity, aVp = 1, 
if and only if a = 1, or ais a copoint not on p; so Weisner’s theorem, 
Theorem 25.3, gives 


(25.6) w(0z,1z)=- >> p(0z,h). 


h:h<1z,,hZp 


Since all terms on the right-hand side have sign (—1)'*"(4)-! by the 
induction hypothesis, the proof is complete. 


The number of proper colorings (x) of a graph G with x colors 
can be found by Mobius inversion on the lattice L(G) introduced 
in Chapter 23, although this is not necessarily a very practical 
method. Recall that the elements of L(G) are the partitions A of 
the vertex set of G, all of whose blocks induce connected subgraphs 
of G. For A in L(G), let g(A) denote the number of mappings 
from the vertex set of G into a set of x colors (i.e. colorings) so 
that all vertices in each block of A receive the same color. Clearly, 
g(A) = a4. Let f(A) denote the number of mappings that are 
constant on each block of A, but such that the endpoints of edges 
joining distinct blocks of A receive different colors. Given a coloring 
‘counted by’ g(A), a moment’s thought shows that there is a unique 
coarser partition B so that the coloring is ‘counted by’ f(B) (one 
must coalesce two blocks of A of the same color if an edge joins 
them). Thus, g(A) = )¢,. 4 f(B), so Mobius inversion gives 


f(A) = >¢ uA, B)g(B). 


BrA 
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The number of proper colorings is f evaluated on O;,q) (the parti- 
tion into singletons), and this is 


Xoq(2) = S- u(Ox(@); B)x'5 a » S- ura); B) ge. 
B 


The polynomial x(x) is called the chromatic polynomial of the 
graph. The corollary above leads directly to the following state- 
ment. 


Theorem 25.4. The number of proper colorings of a graph G on n 
vertices in x colors is given by a monic polynomial y (x) of degree 
n whose coefficients alternate in sign. 


Problem 25C. Let G be a simple graph with n vertices and m 
edges. Show that the coefficient of x"~! in X,(x) is —m, and that 
the coefficient of 2”~? is m(m—1)/2 minus the number of triangles 


in G. 


T. Dowling and R. M. Wilson (1975) proved the following the- 
orem and its corollary, the latter being a generalization of the in- 
equality of Theorem 19.1 on linear spaces. 


Theorem 25.5. If L is a finite lattice so that u(x,1,) #0 for all 
x € L, then there exists a permutation 7 of the elements of L so 
that x V n(x) = 1, for all x € L. 


PROOF: Before we begin the proof, we remark that an example 
of a lattice that does not satisfy our hypothesis is a chain with 
more than two elements. And, of course, there is no permutation 
am with the property stated above. On the other hand, the lattice 
of all subsets of an n-set admits a unique permutation with the 
property above, namely the permutation that takes each subset to 
its complement. 

All matrices here will have rows and columns indexed by L. Let 


dae Vy SA 


0 otherwise. 


n(@,Y) = { 
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Let 6; be the diagonal matrix with 6)(2,x7) := u(az,1z). Then 
¢6,¢' = 7 because 


C516" (x, y) 


oe (a, a)oy (a, b)C(y, b) 


a,b 


= » d1(a, a) 


a:a>x and a>y 


S- ula, Iz) 


aazx“Vy 


and the last sum is, by (25.1), 1 if Vy = 12, and 0 otherwise. 

Under our hypothesis that u(x,1,) 4 0, the matrix 6; is nonsin- 
gular. Since ¢ is nonsingular too, we conclude that 7 is also. Hence 
some term in its determinant expansion does not vanish, and this 
implies the conclusion of the theorem. 


Corollary. In a finite geometric lattice of rank n, the number of 
elements of rank > n—k is at least the number of elements of rank 
<k,O<k<n. 


PROOF: Consider a permutation 7 as in Theorem 25.5 (which ap- 
plies because of the Corollary to Theorem 25.3). The semimodular 
law 


rank(x) + rank(m(x)) > rank(x V m(x)) + rank(x A m(x)) >n 
implies that the image of an element of rank < k is one of rank 
>n—k. 

We give a similar matrix proof of a theorem of T. Dowling (1977) 
on complementing permutations below. 


Theorem 25.6. If L is a finite lattice such that p(x,1,) #0 and 
u(Oz,x) #0 for all x € L, then there exists a permutation 7 of the 
elements of L so that 


eV ale)=H 1p. and ah rae) =0p 
for allz € L. 


PROOF: Let 6; be as in the proof of the previous theorem and let 
do be the diagonal matrix with 6o(x, 2) := u(0z,2). Now consider 
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K := €6,;C'd9C. Our hypotheses imply that « is nonsingular. We 
claim that «(x,y) = 0 unless x and y are complements. Then any 
permutation corresponding to a nonvanishing term in the determi- 
nant expansion of « will be a complementing permutation. 

To establish our claim, first note that & = nd9¢, so 


K(ey)= > (02,2). 


zzVa=lp,z<y 


If this sum is not zero, then there exists some z with z V x = I, 
and z < y, which implies that y Vz = 1,. By duality, 7’ = €'6o¢, 
where 

de Tee =O; 


0 otherwise. 


(x,y) = { 


Now note that « = ¢6,7' and, similarly, «(«, y) 4 0 will imply that 
xy = Or. 


Finally, we give the proof we promised at the beginning of the 
chapter. 


PROOF OF THEOREM 25.1: (i) Since the interval [A, B] is isomor- 
phic to the lattice of subsets of B\A, it suffices to show p(0,C) = 
(—1)/Cl. We use the equation (25.6) and proceed by induction on 
|C|. Let p be a point of C. There is only one copoint not on p, 
namely the complement of {p}. So (25.6) implies that 


w(O,C) = —p(0, C\{p}) = —(-1)fE-9 = (-1) 141. 


(ii) Again, it will suffice to calculate (1,m), and we use induc- 
tion on m. Let p be a prime divisor of m. Weisner’s theorem 
asserts 

(1, m) — S- (0, a). 


Icm(a,p)=m,a<m 


If p* divides m, then the sum on the right is empty, and thus 
u(1,m) = 0. If p? does not divide m, then there is one term, 
namely when a = m/p. 
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(iii) It will suffice to show that y(0,V) = (—1)kg) for V of 
dimension k. We proceed by induction on k. Let P be a 1- 
dimensional subspace of V. By Weisner’s theorem, 


U:UVP=V,UAV 


By the induction hypothesis, each term j(0,U) in the above sum 


is equal to (—1)F-1g*2"), The only subspaces U other than V such 
that U V P = V are those U of dimension k —1 that do not contain 


P; their number is 
Hew a 
q q 


This completes the proof of part (iii). 


(iv) Consider two partitions A and B with A = B. Say B has k 
blocks that, when numbered, are the unions of n1,no,...,2, blocks 
of A, respectively. Then the interval [A, 6] is isomorphic to the 
direct product of partition lattices 


Tc eis ess Tle: 


since a partition C with A < C ~ B is specified by a partition of the 
n; blocks of A that lie in the i-th block of B, for each 1 = 1,2,...,k. 

We now mention the fact that the Mobius function wp.g of the 
direct product P x Q of two partially ordered sets is the (Kronecker 
or tensor) product of the Mébius functions wp and pg of P and Q; 
that is, 


LpxQ((a1, 61), (a2, b2)) = wp(ai, a2) (b1, 62). 


(We leave the reader to think about this; just check that defining 
[ipxg as above produces an inverse of ¢pxg.) Thus p(A, 8) is the 
product of 1(On,,,, 1n,,) for ¢=1,2,...,k. 

We now show that (0q,,1n,) = (—1)""1(n — 1)! by induction 
on n. Let P bea point of II, i.e. a partition of an n-set into a 2-set 
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{x,y} and n — 2 singletons. The partitions A so that P VA = 1h, 
are the 2”~? partitions with two blocks that separate x and y; there 
are (eo) such partitions where the block containing x has size 1+ 1 
and the block containing y has size n — 1 —7%. From (25.6) and the 
induction hypothesis, 


“(0n,,, ln, ) = — 5 é Ff *) (—1)*(4)'(—1)""7"*(n — 2 — 2)! 
i=0 
= (-1)" (in - 1)! 


Problem 25D. The dual of Weisner’s theorem (derived by apply- 
ing Theorem 25.3 to the dual lattice) asserts 


S° w(e,1z)=0 for alla€ L witha <1). 


a:xA\a=0L 


Take a to be a partition with one block of size n— 1 and one of size 
1 (in the partition lattice) and use the above to give a somewhat 
different proof of (Oy, , 1n,,) = (-1)"1(n - 1)!. 


We use Mobius inversion over the lattice of partitions to get an 
expression for the number of connected labeled simple graphs on 
n vertices. Let X be an n-set. To each graph G with vertex set 
X, there corresponds the partition Cg of X whose blocks are the 
vertex sets of the connected components of G. We let g(B) denote 
the number of simple graphs G with V(G) = X and Cg = B. We 
let f(B) denote the number of simple graphs G with V(G) = X 
and where Cg is a refinement of 6. Clearly, 


AxB 


We are interested in g(1n,,), where 1q, is the partition with one 
block. But what is easy to calculate is f(B); we get a graph counted 
by f(8) by arbitrarily choosing a simple graph on each block of B, 
so if B has k; blocks of size 7, then 


3 
2 


f(B) = 2h2(2)ak(2) .. ok), 
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By Mobius inversion, 


9(B) = >> uA, B) f(A). 


AxB 


Specializing to B = 1y,,, we have 


(25.7) 9(1n,,.) = >> (A, In,,) f(A). 
A 


Recall that the number of partitions of X of type (ki, k2,..., kn) is 


n! 
(1!)F1ky!(2!)*2 kg! --- (mn! ink? 


cf. (13.3). And using Theorem 25.1(iv), we find the number of 
connected labeled graphs on n vertices to be 


_4)\kitket-t+ky-1 n! (ky fe tae Kn a 1)! ko (2)+k3(3)+--+k, (@ 
( 1) 2 (3) (3) (3) 
Z (Fig! --- (nl) Fake! 


where the summation is extended over all partitions (ky, k2,..., kn) 
of the integer n, that is all nonnegative n-tuples with 1k, + 2k) + 
---+nk, =n. 

We do not mean to imply that this is a particularly nice answer 
to our original question. Indeed, it is a summation over a large 
collection of objects, whose number exceeds eV” for large n. But it 
is a natural illustration of Mobius inversion and does greatly reduce 
the problem. For n = 5, we have 


partition of 5 number of A f(A) u(A, ln) 
5 iL 1024 1 
Al i) 64 sal 
32 10 16 —1 
dll 10 8 2 
221 15 4 2 
2111 10 2 —6 
11111 i i 24 
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from which table and (25.7) we see that 728 out of the 1024 simple 
labeled graphs on 5 vertices are connected. 


We give an application of Mobius inversion to coding theory. 
Recall the definition of MDS codes given in Example 20.3. Let C 
be such a code, i-e.an |[n,k,d] code over F, with d=n—k+1. 
If we consider a set of d positions and then look at the subcode 
of C' with zeros in all other positions, this subcode has dimension 
>k—(n-—d) =1. Since this subcode has minimum distance d, it 
must have dimension exactly 1 by Theorem 20.2. It follows that for 
n > d' > d, specifying a set of d’ positions and requiring codewords 
to be zero in all other positions, will define a subcode of C’ with 
dimension d'—d+1. We use this in the proof below. We shall show 
that the weight enumerator of an MDS code is determined by its 
parameters. 


Theorem 25.7. Let C be an |n,k| code over F, with distance 
d=n—k+1. Then the weight enumerator of C is 1+ 37, Aiz’, 
where 


Proor: Let R be a subset of N := {0,1,...,n}. Define f(R) to 
be the number of codewords (co, ¢1,.--,¢€n—1) for which {7 : ¢; 4 
O} = R. For a subset S of N, we define g(S) := dincg f(R). As 


remarked above, we have 


w= ft if |S| <d—1, 
ae gtk. aha) S| Sa: 


The definition of f implies that A; = ren rj=i F(R). We now 
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apply Mobius inversion with Theorem 25.1 (i). We find 


A = >) do aS,R)a(5) 


RCN, |R|=i SCR 


{EQ EQerwe 


j=o 


OEQere 


j=d 


The result follows if we replace 7 by i — 7 and then use (;) = 
i-1 aes 
Gai) + (5). 

Theorem 25.7 implies rather severe restrictions on the alphabet 
size of an MDS code. 


Corollary. If there exists an MDS code C over F, with length n 
and dimension k, then 


(i) gq>n-—k+lork<1, 
(ii) gq>k+lord=n-—k+1<2. 


Proor: (i) Let d= n—k+1. From Theorem 25.7 we find for 
d <n that 


0< Aun = (4%, )@-Da-a. 


(ii) Let G := (I, P) be the generator matrix of C. Since C’ has 
minimum weight d, every set of d— 1 = n-—k columns of the parity 
check matrix H := (—P' I,,_,) is linearly independent. Hence every 


square submatrix of H is nonsingular. So, no codeword of C+ has 
n —k zeros, i.e. C+ is also MDS. Apply the result of (i) to C+. 


Problem 25E. Let P be a partially ordered set. Remember that 
if x < y, then a sequence © = 4% < 4% <--- < ap = y is called a 
chain of length k from x to y. Let c,(x,y) denote the number of 
such chains (so c;(a, y) = 1). Prove that 


p(x, y) = >> (-1)*ex (a, y). 


k>1 
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Problem 25F. We consider the lattice of subspaces of the vector 
space V := Fj. Let S be another vector space over Fj. Define for 
a subspace U: 


f(U) := the number of linear maps from V to S with kernel U, 


g(U) := the number of linear maps from V to U with kernel > U. 
(i) Determine g(U) and then apply Mobius inversion to 


gU)= >> FW). 


W:W2DU 


(ii) Show that 


FLOW =d0 i (-1)*q2)| sy". 


k=0 


(iii) Prove the polynomial identity 


Tle -0) = 0 ae 


k=0 k=0 


Notes. 
The expression of the chromatic polynomial of a graph in terms 
of the Mobius function of L(G) is due to G.-C. Rota (1964). 
Theorem 25.1 (v) is essentially Euler’s formula for polytopes: 


fo= Athol he = 9, 


where f; is the number of faces of rank i, or dimension 7 — 1 (fp = 
fn = 1). See B. Griinbaum (1967). Also see R. Stanley (1986), 
where posets with the property that p(x, y) = (—1)¢ whenever the 
ranks of x and y differ by d are called Eulerian posets. 
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26 


Combinatorial designs and 
projective geometries 


Geometries over finite fields are a rich source of combinatorial de- 
signs and related combinatorial configurations. We begin with two 
topics (arcs and subplanes) that we choose to mention only in pro- 
jective planes before going on to discuss quadrics and other config- 
urations in general projective spaces. 

An (m,k)-arc in a projective plane is a set of m points, no k+1 of 
which are collinear. We were concerned with (m, 2)-arcs in Problem 
191. 

Let A be an (m,k)-arc in a projective plane of order n and let x 
be a point in A. The n+ 1 lines on zx each contain at most k — 1 
other points of A, so 


m<1+(n+1)(k-1). 


An (m,k)-arce A is called perfect when equality holds above. Any 
line that contains a point of a perfect (m, k)-arc evidently contains 
exactly k points of the arc; that is, 


|EN A|=Oork 


for any line L. Clearly, the nonempty intersections of lines with a 
perfect (m, k)-arc provide the blocks of a Steiner system $(2,k,m). 
A single point is a perfect (1,1)-arc. The set of n? points not on 
a fixed line of a projective plane of order n is a perfect (n?,n)-arce 
and the corresponding Steiner system is an affine plane of order n. 
The hyperovals in Problem 19I are perfect (q+ 2, 2)-arcs. The cor- 
responding designs are trivial. But these have ‘dual’ arcs where the 
corresponding Steiner systems are interesting—see Problem 26A. 
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The construction of perfect (m,k)-arcs in Desarguesian planes 
of even order given in part (2) of the following theorem is due to 
R. H. F. Denniston (1969). No examples of perfect (m, k)-arcs, 
1<k <n, in projective planes of odd orders n are known at this 
time. 


Theorem 26.1. 


(1) If there exists a perfect (m,k)-arc in a projective plane of 
order n, then k divides n. 

(2) If q is a power of 2 and k divides q, then there exists a 
perfect (m, k)-arc in PG(q). 


PRroor: Let x be a point not in a perfect (m,k)-arc A in a pro- 
jective plane of order n. The lines on «x partition the remaining 
points, but each line on x contains exactly 0 or k points of A. So 
k must divide m = 1+ (n+ 1)(k—1). It follows that k divides n. 

Now let gq be a power of 2 and & a divisor of g. Let f(x,y) = 
ax? + Bry + yy* be any irreducible quadratic over F,, and let H 
be any subgroup of the additive group of F, with order k. In the 
affine plane AG(2, q), let 


A:={(a,y): f(x,y) € AH}. 


We claim that any (affine) line meets A in 0 or k points. When the 
affine plane is embedded in a PG2(q), A will be a perfect (m, k)-arc. 
Consider a line L = {(z,y) : y = mx +b} with 6 and m both 
nonzero. (We leave consideration of lines of the forms {(z,y) : y = 
mx} and {(x,y) : x =c} to the reader.) 
The intersection LM A is the set of points (2, mx + b), where 


ax’ + Ba(ma +b) +7(ma +b)? € H, or 
F(x) € H, where F(x) := (a+ 6m+- ym?)a2? + Boa + 707. 
We are working over a field of characteristic 2, so 
ze (a+ bm+ym*)x? + Box 


is a linear mapping; it has kernel of order 2 since the irreducibility 
of f(x,y) ensures that both @ and the coefficient of x? are nonzero. 
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The image Kr of this mapping is thus a subgroup of order q/2 of 
the additive group of F,. The image of F is a coset of Kp, but since 
F(x) is never 0 (again by the irreducibility of f(x, y)), the image 
of F is the complement F, \ Ar. In summary, 


2 ifa ¢ Kp, 


Ne { 0 ifae Kp. 


Thus |LNA| = 2|H (EF, \Kr)|. The subgroup # is either contained 
in Kp, in which case LM A = @, or intersects Kp in a subgroup of 
order k;/2, in which case |L A| = k. 


As a corollary of this theorem (with n = 2*!, k = 2”), we 
obtain Steiner systems 


S(2, one. g2mt+1 -_ om), 


Steiner systems S(2,k, v) are in many ways most interesting when v 
is close to k?, as is the case for affine and projective planes (Fisher’s 
inequality, Theorem 19.6, shows v > k? —k +1). These examples 
have v < 2k, which is still impressive. 


Problem 26A. Let A bea perfect (m, k)-arc in a projective plane 
P of order n, 1 < k <n. Let A®* be the set of lines that do not 
meet A. Show that A* is a perfect (m*, |n/k])-arc in the dual plane 
P*, and calculate m* in terms of m, k,n. 


Problem 26B. A parallel class in a Steiner system S$(2,k,m) is 
a set A of m/k blocks so that each point is incident with exactly 
one block in A. An S(2,k,m) is resolvable when the blocks can be 
partitioned into parallel classes (cf. Problem 19K). 

Let A be a perfect (m,k)-arc in a projective plane P of order 
n. Explain why the Steiner system $(2,,m) whose blocks are the 
nontrivial intersections of lines with A is resolvable. 


A subplane S of a projective plane P is a substructure of P that 
is a projective plane in its own right. Recall that a substructure of 
an incidence structure (P,, 6,1) is an incidence structure (Po, Bo, Io) 
where Py C P, By © B, and Ip = IN (Po x Bo). Note that given 
an automorphism (or collineation) a of a projective plane P, the 
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substructure S consisting of the points of P fixed by a and the lines 
of P fixed by a has the property that two points of S are incident 
with a unique line of S, and two lines of S are incident with a unique 
point of S. If S contains four points, no three collinear, then S is 
a subplane; but it may be a nearpencil or have one or no lines. 


Example 26.1. Let V := F?.. The points and lines of PG(q") are 
the 1-dimensional and 2-dimensional Fj» -subspaces of V. Define a 
substructure S of PG'2(q") whose points and lines are those 1- and 
2-dimensional Fj» -subspaces of V that admit bases consisting of 
vectors whose entries are in the subfield F,. Then S is a subplane. 
A line of S is incident with g” +1 points in PG2(q"), but only q+ 1 
of those are points of S. 


Theorem 26.2. If a projective plane P of order n contains a sub- 
plane S of order m <n, then either 
(i) n = m?, or 


(ii) n> m?+m. 


ProoF: Let L be a line of the subplane S and x a point on L in 
P but not in S. The n other lines on x can contain at most one 
point of S (since a line M in P containing at least two points of 
S would necessarily belong to the subplane, and then the point x 
in common to M and L would also belong to the subplane). The 
line L contains m+ 1 points of S, which has a total of m?+m+1 
points, each of which is on some line through x, so m? < n. 
Equality will imply that every line of P will meet S (evidently 
in one or m+ 1 points) since if there were a line N disjoint from 
S, and x were taken as the point of intersection of DL and N, each 
of the m? points of S not on L would belong to one of n — 1 lines. 
Now assume m? <n, so that there does exist a line N incident 
with no points of S. Each of the m?+ m+ 1 lines of S contains 
one point of N and no two such lines can contain the same point, 
som?+m+1<n4+1. 


Problem 26C. Show that if PG2(F) contains the Fano configu- 
ration PG (2), then F has characteristic 2. Suggestion: Without 
loss of generality, four points of the Fano configuration are (1, 0,0), 
(0,1,0), (0,0,1), and (1,1,1). Calculate the homogeneous coordi- 
nates of the other three points (that must lie on a line). 
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Problem 26D. A blocking set in a projective plane is a set S of 
points that contains no line but such that every line meets S' in at 
least one point. Show that a blocking set in a projective plane of 
order n contains at least n+./n+1 points, and that equality holds 
if and only if S is the set of points of a Baer subplane. 


We illustrate a ‘polynomial method’ with the proof of the follow- 
ing theorem on a type of affine blocking sets, due to Brouwer and 
Schrijver (1978). 


Theorem 26.3. If V is a subset of AG(k,q) that meets all the 
hyperplanes, then 

|V| > k(q-1) +1. 
ProoF: Let A be the blocking set. Without loss of generality 
0¢ A. Let B:= A\{0}. Then B intersects all hyperplanes that do 
not contain 0. These are defined by equations w,x71+---+wpr, = 1, 
where w runs through all non-zero vectors. Therefore 


PG yoy .4 45,24) = [[@m + botg +--+ + bet, — 1) 
beB 
is identically 0 on the space except in 0. 

It is easily shown by induction that a polynomial that is identi- 
cally 0 on the space must be in the ideal generated by the polyno- 
mials x! —2;,i=1,...,k. Write F(x) as 

k 

PB ge Be) = Seite iy) a G(Bigesechp) 

i=1 
where the highest degree of x; in G is at most q—1. For each 3, 
the polynomial 27; F'x,...,x,) is identically 0, so this is also true 
for 2;G(x1,...,2%). Therefore G is divisible by T(x? — 1). Since 
F(0) £0, also G(O) 4 0 and therefore G has degree k(q — 1). So, 
the degree of F’', which is |B|, must be at least k(q — 1). 

A quadratic form in indeterminates 11, %2,...,2, over a field F 
is a homogeneous polynomial of degree 2 in those indeterminates, 
1.e. 


(26.1) f(x) = f(a, DQ, +265 re = S- CijXitj 


tj=l 
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where the coefficients cj; are in F. The same quadratic form is 
defined by more than one matrix C = (c;;) of coefficients, since it 
is only the diagonal entries and the sums ¢;; + cj; that are impor- 
tant. We could require that c;; = 0 for 7 > 7 and then quadratic 
forms in 71, %,...,%, are in one-to-one correspondence with upper 


triangular n x n matrices C’ over F. 
A quadric in PG,(F) is a set of projective points 


Q = Q(f) = {(x) : F(x) = 0} 


where f is a quadratic form in n + 1 indeterminates. We must 
choose a basis and identify the vectors with (n + 1)-tuples over F 
in order to make this definition. 

Two quadratic forms f and g are projectively equivalent when 
g can be obtained from f by an ‘invertible linear substitution’, 
or in matrix notation, g(x) = f(xA) for some nonsingular n x n 
matrix A over F. So if f is given by a matrix C as in (26.1), i.e. 
f(x) =xCx', then g is given by the matrix ACA'. For example, 
for any positive integer n, nv? + nx} + nx} + nx is projectively 
equivalent to x7 + v3 + 23 + xj over the rationals; see equation 
(19.12). 

We remark that the matrix notation does not lend itself well to 
changes of the names or number of the indeterminates, which we 
wish to allow. For example, replacing x; in a quadratic form f 
by yi + yo, where y, and yg are new indeterminates, produces a 
projectively equivalent form. 

The rank of a quadratic form is the least number of indeter- 
minates that occur (with nonzero coefficients) in any projectively 
equivalent quadratic form. For example, (2; + ---+2,)? has rank 
1. Projectively equivalent quadratic forms have the same rank. 
A quadratic form in r indeterminates is said to be nondegenerate 
when it has rank r. 


Example 26.2. Consider quadratic forms in two indeterminates 
f(a, y) = ax* + bry + cy’. 


The form has rank 0 if and only if a = b = c = 0. The corresponding 
quadric contains all points of PG\(F) (the projective line). If not 
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zero, it will have rank 1 if and only if it is a scalar multiple of the 
square of a linear form dx + ey, which is the case if and only if the 
discriminant b? — 4dac is 0. The corresponding quadric in PG(F) 
will then consist of a single point. A rank 2 quadratic form in 
two indeterminates is either irreducible or can be factored into two 
distinct linear forms. In the first case, the corresponding quadric 
in PG,(F) will be empty, and in the second case, the corresponding 
quadric in PG,(F) will consist of two points. 


It is clear that an irreducible quadratic form of rank 2 is not pro- 
jectively equivalent to a reducible quadratic form, since the latter 
has zeros in F and the former does not. A reducible quadratic form 
is projectively equivalent to 71%2. 


Problem 26E. Show that over a field F of odd characteristic, a 
quadratic form f as in (26.1) is degenerate if and only the sym- 
metric matrix C + C' is singular. For a field F of characteristic 
2, show that f is degenerate if and only if C + C' is singular and 
x(C + C') =0 for some x with f(x) = 0. 


It is important to understand that the intersection of a quadric 
with a flat U of PG,,(F) is a quadric in that flat. For example, 
suppose that U is a projective line PG'(F) and that f is a quadratic 
form in %,%1,..-,2%n. The homogeneous coordinates of points of 
U are (ya+ zb) where a = (ao, @1,...,@n) and b = (bo, b1,..., bn) 
are linearly independent vectors. These points are in one-to-one 
correspondence with the homogeneous coordinates ((y, z)) of points 
of PGi(F). Say f = Mocicjen Cijvivj. Then 


9(y.2):= S— cij(yai + 2b;)(yaj + 2b;) 


1<i<j<n 


is a quadratic form in y and z and defines a quadric in PG;(F). 
The quadratic form g may be degenerate even if f is not. In view 
of Example 26.2, we see that a line is either contained fully in a 
quadric Q, or meets Q in 0, 1, or 2 points. 


Lemma 26.4. Any quadratic form f of rank n > 3 is projectively 
equivalent to 


(26.2) £1L_ + g(X3,---, Ln) 
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for some quadratic form g(x3,...,%n).- 


PROOF: First assume q is odd. It is easy to see that a quadratic 
form f is projectively equivalent to some )/)<;<;<, Cijtvitj where 
cy #0. Let y := 21 + Fay (C122 +++++¢in%n). Then f = eny? + 
g(X2,---,Xn). Inductively, if the rank of f is at least 3, we find that 
f is projectively equivalent to 


h(x) = ax} + bar} + cx} + g'(a4,.--,2n) 


with a,b,c all nonzero. The three scalars can be changed by any 
nonzero square factor and a projectively equivalent quadratic form 
results; some pair must differ by a square factor, so we can assume 
that b = c, say. 

We claim that there exist s,t € FE, so that s? +t? = —b-la. 
One way to see this is to consider the addition table of F,, which 
is a Latin square. The number of squares in F,, including 0, is 
(¢ + 1)/2. The element —b-‘a (or any other) occurs (q — 1)/2 
times in the (q — 1)/2 columns of the Latin square indexed by the 
nonsquares, and (q — 1)/2 times in the (q — 1)/2 rows indexed by 
the nonsquares; so —b~'a must occur at least once in the submatrix 
with rows and columns indexed by the squares. With s and t so 
chosen, h is projectively equivalent to 


ax? + b(sxq + tx3)* + b(ta — x3)? + g/(a4,...,2n) 


2 2 2 / 
= ax} — ax — ar3+9'(24,.--,2n) 


= (ax + ax2)(x1 — £2) — azz Pig (Tis. an 


and the latter is clearly projectively equivalent to (26.2). 
The case q even is a little more tedious; see the proof of Theorem 
5.1.7 in J. W. P. Hirschfeld (1979). 


Theorem 26.5. 
(i) Any quadratic form f of odd rank n is projectively equivalent 
to 


(26.3) fo(x) = @y%q +--+ + Lp —2En_1 + ca? 


for some scalar c. 
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(ii) Any quadratic form f of even rank n is projectively equivalent 
to either 


(26.4) fi (x) I= 2XQ ++ + Ly-3Xp—2 + Ln-1Ly 
or 
(26.5) fo(x) = 11 Bq +++ + Ly-3n-2 + 3 ae Di) 


where p(Xn_-1,%n) is an irreducible quadratic form in two indeter- 
minates. 


ProoF: This follows from Lemma 26.3 and induction. 


Quadratic forms (and the corresponding quadrics) of odd rank 
are called parabolic. Quadratic forms of even rank projectively 
equivalent to (26.4) are called hyperbolic, and those equivalent to 
(26.5) are called elliptic. Any two hyperbolic quadratic forms of a 
given rank are projectively equivalent to (26.4) and hence to each 
other. It is also true that all parabolic quadratic forms of a given 
even rank are projectively equivalent, i.e. we may take c = 1 in 
(26.3), and that all elliptic quadratic forms of a given rank are pro- 
jectively equivalent. See J. W. P. Hirschfeld (1979) for this and 
more on canonical forms. That hyperbolic and elliptic quadratic 
forms are not projectively equivalent is a consequence of the fol- 
lowing theorem. 


Theorem 26.6. A nondegenerate quadric Q in PG,,(q) has cardi- 
nality 


cS if n is even, i.e. Q is parabolic, 
(m+1)/2_4)(g(n-1)/2 Hien . : 

(g a4 +) if n is odd and Q is hyperbolic, 
(n+1)/2 (m=1)/2_ dicen ; ae 

(g <M Dif n is odd and Q is elliptic. 


PrRooF: In general, if 
Pisa gk) = 102 gl hgy04 54 hy) 


and there are N vectors (#3,...,x,) such that g(x3,...,2,-) = 0, 
then there are (2g — 1)N + (q — 1)(q’-? — N) vectors (a1,...,2r) 
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such that f(x1,...,2,-) = 0. This allows us to verify by induction 
the following formulae for the number of zeros in F; of a rank r 
quadratic form f in r indeterminates as the following: 


ames if r is odd, i.e. f is parabolic, 


gq t+q/?—q'/?"! ifr is even and f is hyperbolic, 
q i-q?+q "ifr is even and f is elliptic. 


Of course, the number of projective points in PG,,(q) on the cor- 
responding quadrics is found from the above numbers by replacing 
r by n+ 1, subtracting 1 (the zero vector) and then dividing by 
q-—1. 


Theorem 26.7. Let Q be a nondegenerate quadric in PG,,(q). 
The maximum projective dimension of a flat F with F C Q is 


n/2—1 if n is even, i.e. Q is parabolic, 
(n—1)/2 ifn is odd and Q is hyperbolic, 
(n —3)/2 ifn is odd and Q is elliptic. 


PROOF: Let f be a nondegenerate quadratic form in r indetermi- 
nates. The statement of the theorem is equivalent to the statement 
that the maximum dimension of a subspace U of F7/ such that f 
vanishes on U is 


(r—1)/2 if r is odd, i.e. f is parabolic, 
Fi2 if r is even and f is hyperbolic, 


r/2—1 if r is even and f is elliptic. 


First note that if f = fo in (26.3), then f(x) = 0 for any 
x € span(€g,e4,...,€,-1), where e), e2,...,e, is the standard ba- 
sis for F;. If f = fi in (26.4), then f(x) = O for any x € 
span(e2,e4,...,e,). If f = fo in (26.5), then f(x) = 0 for any x € 
span(e2,e4,...,€;-—2). These subspaces have dimensions (r — 1)/2, 
r/2, and r/2 — 1, respectively. 

It remains to show that f cannot vanish on any subspace of 
larger dimension in any of these cases. We will use Theorem 26.5 
and induction. The cases r = 1 and r = 2 are trivial. Suppose 
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Pig Ge) = Bie + 9 (3, >. @,) and that f(x). 0: for all x 
in some subspace U C FE? of dimension k. To complete the proof, 


of dimension 


we will show that there exists a subspace U’ C 
>k-—1so that g(y) = 0 for ally € U’. 
Clearly, g(y) = 0 for all y in 


gr—2 
q 


Uo := {(S33o Ste) : (0,0, @3,...;%r) € eae 


We are done if dim(Up) > k — 1, so assume dim(Up) = k — 2. Then 
there exist vectors 


(Le Oya9, eee@,) and: (0) 1363.20 056.) 


in U. Then (1,1,a3+53,...,a@-+6,) € U so g(a3+b3,...,@,+,) = 
—1. The form g evidently vanishes on all vectors in 


span((a3,...,@;)) +Up and span((bs,...,b;)) + Up. 
One of these subspaces must have dimension > k —2 because other- 


wise (a3 +3,...,a,+,) € Uo, which contradicts g(a3+63,...,a-+ 
b,) = —1. 


Example 26.3. Let f be a quadratic form in three indeterminates 
and consider the quadric Q(f) in the projective plane PG2(q). If f 
is nondegenerate, there are g+ 1 projective points on Q(f). As we 
remarked above, any line L of PG2(q) meets Q(f) in a quadric in 
L, which in this case consists of 0, 1, or 2 points since Q(f) cannot 
contain L by Theorem 26.7. So Q(f) is a set of g +1 points, no 
three of which are collinear, i.e. an oval. See Problem 191. 

If f has rank 0, Q(f) is all of PGo(q). If f has rank 1, Q(f) 
consists of the points of a line in PG»(q). If f has rank 2, there 
are two cases. If f is reducible, say f is projectively equivalent to 
xy, then Q(f) consists of the points on the union of two lines of 
PG2(q); if f is irreducible, then Q(f) is a single point in PG(q). 


Example 26.4. The first order Reed-Muller codes were intro- 
duced in Chapter 18. Here is one way to introduce the entire 
family: Let V = Fj”. We consider vectors of length 2” whose coor- 
dinates are indexed by elements of V; for concreteness, write V = 
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{vo, Vi,--+;Vam_1}. The k-th order Reed-Muller code RM(k,m) is 
defined to consist of all vectors (of length 2”) 


(f(vo), f(v1),-- +> f(v2m—1)) 


where f ranges over all polynomials in 71,..., 7 of degree at most 
k. 

Since a linear form x;, +--+ 2%;, gives the same function as the 
quadratic form a7, +-+-+2?. over Fy, the words in the second order 
code RM(2,m) are given by binary quadratic forms f(x1,...,2n) 
and their ‘complements’ f(x1,...,2n) +1. Theorems 26.4 and 26.5 
are useful for determining the weights that occur for codewords in 
RM(2,m). 

We must consider degenerate forms also; e.g. 7122 + x3%4 corre- 
sponds to a codeword of weight 24 in RM(2,6). It will be found 
that the weights of codewords in RM (2,6) are 0, 16, 24, 28, 32, 36, 
40, 48, and 64. 


Problem 26F. Let f(x) := x17 +-+++22m-1%2m. Then f deter- 
mines a coset C of RM(1,2m) in RM(2,2m) consisting of words 
corresponding to f(x) + a(x), where a(x) ranges over the 27"! 
affine functions (linear plus possibly a constant term) in 2m vari- 
ables. Show that in C half of the words have weight 27”-!+ 27-1! 
and half have weight 27-1! — 27-1. 


Example 26.5. Let Q3 be a nondegenerate elliptic quadric in pro- 
jective 3-space PG'3(q). By Theorem 26.6, |Q3| = q?+1. By Theo- 
rem 26.7, Q3 contains no lines. Any plane P meets Q3 in a quadric 
in that plane; in view of Example 26.3, every plane must meet Q3 
in either an oval in that plane or a single point. Any three points of 
Qs3 are contained in a unique plane, so it follows that the nontriv- 
ial intersections of planes with @3 provide the blocks of a Steiner 
system 
S(3,¢41,¢7°+1). 

In general, a set of gq? +1 points, no three collinear, in PG'3(q) is 
called an ovoid and an $(3,n + 1,n? +1) is called a Mobius plane 
or an inversive plane. 


Example 26.6. Let Q, be a nondegenerate quadric in projective 
4-space PG,(q). By Theorem 26.6, |Q4) = @+q@+4+q+1. Let 
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O be the incidence structure whose points are the elements of Q4 
and whose blocks are the lines of PG4(q) fully contained in Qu. 
Each point of Q is on exactly q+ 1 blocks of Q (see Problem 26F). 
Given a point x and a block L of Q, the intersection of the plane 
P:= {x}VL with Q, is a quadric Q2 in P that contains, obviously, 
a line and a point off that line. From Example 26.3, Q2 must consist 
of the points on two (intersecting) lines. This implies that Q is a 
partial geometry pg(r,k,t), as defined in Chapter 21, where 


r=q+1,k=q+1,t=1. 


Example 26.7. Let @; be a nondegenerate elliptic quadric in 
PGs(q). By Theorem 26.6, |Qs| = (¢ + 1)(q?+ 1). By Theo- 
rem 26.7, Qs contains no planes. Let Q be the incidence structure 
whose points are the elements of Qs; and whose blocks are the lines 
of PG;(q) fully contained in Q;. By Problem 26F, each point of Q 
is on q? +1 lines of Q. By an argument similar to that of Example 
26.6, Q is a partial geometry pg(r,k,t) where 


r=q4+1, k=q4+1, t=1. 


Partial geometries with t = 1 are called generalized quadrangles. 
See L. M. Batten (1986) for further results and references. 


Problem 26G. Let f be a nondegenerate quadratic form in n 
indeterminates given by a matrix C' = (c;) over F, as in (26.1). 
Let Q = Q(f) be the corresponding quadric in PG',_;(K,). Let 
p = (x) be one of the points on Q. Let 


Tp := {(y) :x(C+C")y' = 0}. 


Then T,, is a hyperplane in PG,_1(F,) by Problem 26E. Show that 
T,Q consists exactly of p and the union of any lines on p that are 
contained in @. Further show that if W is any hyperplane of T;, 
not containing p, then Q’ := W MQ is a nondegenerate quadric in 
W (= PG,-3(F,)) and that Q’ is parabolic, hyperbolic, or elliptic 
according to whether @ is parabolic, hyperbolic, or elliptic. We 
see, in particular, that the number of lines on p that lie entirely in 


Q is equal to |Q’|. 
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A hermitian form in indeterminates 1,...,2, over Fj2 is an ex- 
pression of the form 


(26.6) Be) Sigs ha = S- Cig Lis, 
igj=l 
where the coefficients c;; are from F,2 and where cj; = Gs In 


particular, the diagonal coefficients c;;, being fixed by the Frobenius 
automorphism x +> x, lie in F,. 

Two hermitian forms f and g over F,2 are projectively equivalent 
when g can be obtained from f by an invertible linear substitution, 
or, in matrix notation, g(x) = f(xA) for some nonsingular n x n 
matrix A over F,2. So if f is defined by a matrix C as in (26.6), 
i.e. f(x) =xCx!, then g is defined by the matrix AC .A*, where A* 
is the conjugate transpose of A; if A = (a;;), then A* := (a‘,). 

The rank of a hermitian form is the least number of indeter- 
minates that occur (with nonzero coefficients) in any projectively 
equivalent hermitian form. 


A hermitian variety in PG,,(q) is a set of projective points 
H = A(f) := {(x): f(x) = 0} 


where f is a hermitian form in n + 1 indeterminates. We must 
choose a basis and identify the vectors with (n + 1)-tuples over F,2 
in order to make this definition. It can be seen that the intersection 
of a hermitian variety in PG,,(q”) with a flat is a hermitian variety 
in that flat. 


Theorem 26.8. A hermitian form of rank n is projectively equiv- 
alent to 


(26.7) Ge gs eet git 


PRooF: It is not hard to see that any nonzero hermitian form is 
projectively equivalent to h as in (26.6) where c;,; 4 0; we leave 
this to the reader. Let y := cy%1 + cyt +--+ + Cin%n. Then 


h = cl yy! +g9(x2,...,;2n) where g is a hermitian form in x2,..., 2p. 
Since ci; € F,, there exists a € F,2 such that a?! = cj} and then 
h = 291+ g(x9,..., 2), where z = ay. 


The theorem follows from this step and induction. 
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Theorem 26.9. The number of points on a nondegenerate hermi- 
tian variety H in PG,(q’) is 


(Ge @ apo) 
va 


PROOF: For each nonzero a € F,, there are q + 1 values of x € Fj2 
such that 27*! = a. If 


f(v1,...,2n) = of? + g(axo,...,2n) 


and there are N vectors (%2,...,%n) such that g(x2,...,%n) = 0, 
then there are N+(q41)(q-) —N) vectors (#1,...,2n) such that 


f(x1,...,2%n) = 0. Inductively, there are q?"-! + (—1)"(q" — q”"') 
vectors in F’} that are zeros of (26.7) and the theorem follows. 


Example 26.8. Consider hermitian varieties in the projective line 
PG,(q’). If a hermitian form f in two indeterminates has rank 0, 
then the variety H(f) contains all q? +1 points; if f has rank 1, 
H(f) consists of one point; if f has rank 2, H(f) contains q + 1 
points. It follows that if H is a hermitian variety in PG,,(q’) for 
any n, then lines of PG,,(q*) meet H in 1, q+ 1, or q? +1 points. 

Now consider a nondegenerate hermitian variety H2 in the pro- 
jective plane PG2(q’). It has q?+ 1 points by Theorem 26.9. Any 
line L meets H in a hermitian variety LM H2 in that line. As an 
exercise, the reader should check that no lines are contained in Ho, 
so |LM H| = 1 or q+1. It follows that the nontrivial intersections 
of lines of PG(q?) with H» provide the blocks of a Steiner system 


S(2,¢+1,¢° +1) 


on the point set A». 

Designs with these parameters are called unitals. Further analy- 
sis shows that the designs constructed above are resolvable; see R. 
C. Bose (1959). 


We conclude this chapter with a construction of higher dimen- 
sional analogues of the Steiner systems $(3,q+1, q?+1) of Example 
26.5; those constructed below have been called circle geometries. 


366 A Course in Combinatorics 


Theorem 26.10. Ifgq is a power of a prime and n a positive integer, 
then there exists a Steiner system S(3,q+1,q" +1). 


PROOF: Let V be a 2-dimensional vector space over Fj» and let ¥ 
be the set of 1-dimensional subspaces of V over Fj (i.e. the q” + 1 
points of PG,(q"), the projective line of order q”). 

Now think of V as a 2n-dimensional vector space over F,. The 
blocks of our Steiner system will come from the 2-dimensional sub- 
spaces U over F, of V that are not contained in any member of ¥. 
For every such subspace U, let 


Byte Ae POS 10; 


Note that each P € By meets U in a 1-dimensional subspace over 
F, (that contains g — 1 of the q? — 1 nonzero vectors of U), so 
By| = q+ 1. If W = XU for some nonzero scalar A € Fyn, then 
Bu = By; we take only the distinct sets By as blocks. 

Consider three distinct points P; € ¥,7i = 1,2,3. Say P; is the 
Fjn-span of a vector x;, 7 = 1,2,3. These three vectors are linearly 
dependent over Fj, say 


X3 = WX; + BXe 


with a, 3 € Kj. But then it is clear that U := spang, {@X1, 3X2} 
meets P,, P2, and P3 all nontrivially. Suppose some 2-dimensional 
subspace W over F, meets each of the P; nontrivially, say W con- 
tains y;x; with 0 #4 y € Kj», 1 = 1,2,3. Then these vectors are 
linearly dependent over F,, say 


3X3 = ay1X1 + byoxe2 


where a,b € F,. Since x; and x2 are linearly independent over Fj», 
we have 73a = ay, and 738 = bye. It follows that yzU = W, and 
we see that three distinct points are contained in a unique block. 


Problem 26H. Let a be a collineation of a projective plane P. 
(i) Show that if a fixes all points on a line / and two points not 
on 1, then a fixes P pointwise. 
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(ii) If @ fixes all points on a line J, then there is a point P such 
that a fixes P and every line through P. 
(Such an automorphism of P is called a central collineation.) 


Problem 261. In Chapter 23 we saw that AG(3, ¢q) is a subgeom- 
etry of PG(3,q), where the missing part is a projective plane P, 
sometimes called the plane at infinity for AG(3,q). Let q = 2” 
and let O be a hyperoval in P. We define an incidence structure 
I with as points the points of AG(3,q) and as lines those lines of 
AG(3,q) that meet P in a point of O. Show that I is a generalized 
quadrangle. 


Problem 26J. Consider a hyperoval O in PG(2,4). Define a 
graph G with as vertices the points of the plane not on O and 
edges (x,y) if the line through x and y meets O. Prove: 

(i) Every edge is in a unique triangle. 

(ii) For every triangle (x,y,z) all other vertices are joined to 
exactly one of x, y, z. 

(iii) G is a generalized quadrangle. 


Problem 26K. Consider an arbitrary symmetric (v, k, \)-design. 
Define an arc to be a set S of points, no three on a block. A block 
is called a tangent of S if it meets S in one point. Find an upper 
bound on |.S| depending on whether S$ has a tangent or not. 


Problem 26L. An oval in asymmetric design is an arc that meets 
the bound in Problem 26J. Consider a symmetric design with pa- 
rameters (4A — 1, 2A, A). Show that the ovals of this design are the 
lines of size 3 of the complementary design. 


Notes. 

The term ‘maximal (m,k)-arc’ is often used in the literature for 
what we have called ‘perfect (m,k)-arcs’, but this is too much of 
an abuse of the word ‘maximal’ for us to tolerate. 

The combinatorial properties of quadrics and hermitian varieties 
are discussed in more detail in D. K. Ray-Chaudhuri (1962) and R. 
C. Bose and I. M. Chakravarti (1966). 

The polynomial method illustrated in the proof of Theorem 26.3 
has further uses. E.g. A. Blokhuis (1994) proved that a nontrivial 
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blocking set in PG(2,p) has at least (3p +1)/2 points, a conjecture 
of J. di Paola. The idea is to note that the blocking set S must have 
a tangent, which without loss of generality is the line at infinity. Let 
the other points be (a;, b;) in the affine plane with i = 1,...,p +k. 
A one page complicated study of the polynomial 


pt+k 
F(t,u) = [][(¢+ a; + ub)) 


i=1 
shows that k > Pit 
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27 


Difference sets and 
automorphisms 


A class of symmetric designs arises from difference sets (defined be- 
low) in abelian groups. One such design appeared in Example 19.6. 
The group reappears in the automorphism group of the design. 

In a simple incidence structure, we may identify the blocks with 
sets of points, i.e. the block set A is a family of subsets of the point 
set X. An automorphism of a symmetric design (X,A), or any 
simple incidence structure, is a permutation a of X that takes A 
to A, i. for A C X, A € A if and only if a(A) € A. We begin 
with a theorem on automorphisms of symmetric designs in general. 


Theorem 27.1. Let S = (X,A) be a symmetric (v,k, X)-design 
and @ an automorphism of S. Then the number of points fixed by 
a@ is equal to the number of blocks fixed by a. 


PROOF: Let N be the incidence matrix of S. Define a permutation 
matrix P whose rows and columns are indexed by the points and 


where 
L) “af ola) Sy, 


P(z,y) = { 


O otherwise. 

Define a permutation matrix Q whose rows and columns are in- 
dexed by the blocks and where 

1 if a(A)=B, 


0 otherwise. 


Q4,B) = { 


Note that the trace of P is equal to the number of fixed points, and 
the trace of Q is equal to the number of fixed blocks of a. 
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Now we have 


PNQ' (a, A) = S- P(a, y)N(y, B)Q(A, B) 


yEX,BEA 
= N(a(az),a(A)) = N(a, A). 


That is, PNQ' = N. Equivalently, P = NQN7!. Thus P and 
Q, being similar matrices, have the same trace and the theorem is 
proved. 


Corollary. The type of the cycle decomposition of a on the point 
set X is the same as the type of the cycle decomposition of a@ on 
the block set A. 


Proor: By Theorem 27.1, a’ has the same number of fixed points 
as fixed blocks, for each 7 = 1,2,.... 

Suppose a permutation @ has c; cycles of length 7 on some set 
S,i=1,2,...,|$|. Let f; denote the number of fixed points of (3. 


Then 
ie — > 4Ci, 
lj 
and by Mobius inversion, Theorem 10.4, 
j 
LS dw) fi 
lj 


The point is that the numbers of cycles of each length (i.e. the type 
of 3) are determined completely by the numbers of fixed points of 
the powers of (. 


Corollary. If T is a group of automorphisms of a symmetric de- 
sign, then the number of orbits of T on the point set X is the same 
as the number of orbits of T on the block set A. In particular, T is 
transitive on the points if and only if V is transitive on the blocks. 


PRooF: By Burnside’s lemma, Theorem 10.5, the number of orbits 
of a group [ of permutations of a set S is determined exactly by 
the multiset (f(a) : a € TP) where f(a) is the number of elements 
of S fixed by a. 
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Before introducing difference sets, we digress to give one theo- 
rem on orbits of automorphism groups of arbitrary 2-designs. This 
has been called Block’s Lemma; see Block (1967). This provides 
another proof of the above corollary, as the dual of a symmetric 
design is also a 2-design according to Theorem 19.9. 


Theorem 27.2. If IT is a group of automorphisms of a 2-(v, k, X) 
design with v > k, then the number of orbits of T on the point set 
X is less than or equal to the number of orbits of T on the block 


set A. 


PRooF: Let X,, Xo,...,X, be the orbits of [ on X and let A, Ao, 
...,A; be the orbits of [ on A. Define two s x t matrices C’, D 
as follows. C(i,j) is to be the number of points x € X; that are 
incident with a fixed block A € A; (this number is the same for any 
block A in A;). D(i,7) is to be the number of blocks A € A; that 
are incident with a fixed point x € X; (this number is the same for 
any point x in X;). 

Consider the s x s matrix product CD'. CD'(i,@) is the sum 
over j of the number of ordered pairs (x, A) € X; x A; where z is 
incident with A and A is incident with a fixed point y in X~. So 
CD! (i, ) = \|X;,| if i # 2, and CD'(i,i) = (r — A) + A|X;|; that 
is, 


CD! =(r—A)I + Adiag(|X1|, |Xol,-..,|Xe|) J. 


This is similar to the matrix equation (19.7). Since v > k, we have 
r > X and the matrix on the right can be seen to be nonsingular 
in several ways, e.g. by calculating its determinant. It follows that 
the rank of C' (and D) is s, and this cannot be greater than the 
number t of columns. 


Let G be an abelian group of order v. A (v,k, A)-difference set in 
G isa k-subset D C G such that each nonzero g € G occurs exactly 
A times in the multiset (x — y : x,y € D) of differences from D. 
More formally, we are requiring that the number of ordered pairs 
(x,y) with z,y € D and «—y =g is X when g ¥ 0 and this number 
is k for g = 0. Evidently, A(v — 1) = k(k — 1). 
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Example 27.1. Examples of difference sets include: 
(7,3,1) {1,2,4} in Z, 

(13,4,1) {0,1,3,9} in Zy3 

(11,5,2) {1,3,9,5,4} in Zy 

(16,6,2) {10, 20,30, 01,02,03} in Zyx Zy 
(16,6,2) {0000, 0001, 0010, 0100, 1000, 1111} in Zyx Zyx Zox Zp 


A difference set is nontrivial when 1 < k <v-—1. A difference 
set with \ = 1 is sometimes called planar or simple. 

Let G be an abelian group of order v. For S C G, g € G, we 
denote by S + g the translate, or shift, 


S+g:={xt+g:x€S} 


of S by g. Let D be a k-subset of G, and x,y € G. In general, we 
claim that the number of shifts D+ g that contain both x and y is 
equal to the number of times d := x—y occurs as a difference within 
D. This is because g +> (a—g, y—g) is a one-to-one correspondence 
between the set {g € G: {x,y} C D+ g} and the set of ordered 
pairs (a,b) of elements of D such that a —b =a -—y. (The reader 
may check that this common number is also equal to the cardinality 
of the intersection (D + x7) N (D+ ¥y).) 

In particular, (G,{D+g:g9 € G}) is a symmetric (v, k, A)-design 
if and only if D is a (v,k, A)-difference set. 


Problem 27A. A (v,k, A)-quotient set in an arbitrary group G of 
order v (written multiplicatively) is a k-subset D C G such that 
any one of the following conditions holds: 


(1) Each nonidentity element g € G occurs exactly \ times in 
the list (ry~! : x,y € D) of ‘right’ quotients from D. 

(2) Each nonidentity element g € G occurs exactly \ times in 
the list (x~!y: x,y € D) of ‘left’ quotients from D. 

(3) |DM (Dg)| =X for each nonidentity g € G. 

(4) |DM (gD)| = for each nonidentity g € G. 

(5) (G,{Dg:g € G}) is a symmetric (v, k, X)-design. 

(6) (G,{gD :g € G}) is a symmetric (v, k, X)-design. 

Show that the above six conditions are equivalent. 


Theorem 27.3. Let G be a group of order v. The existence of a 
(v,k, \)-quotient set in G is equivalent to the existence of a sym- 
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metric (v,k, A)-design that admits a group G of automorphisms 
that is isomorphic to G and regular, i.e. sharply transitive, on the 
points of the design. 


Proor: Let D bea (v,k, A)-quotient set in G. Then (G,{gD:9 € 
G}) is a symmetric (v,k,A)-design. For g € G, define a permuta- 
tion g of G by g(x) = gx. Then each g is in fact an automorphism 
of (G,{gD : g € G}) and the group G= {g: 9 € G} of automor- 
phisms is clearly isomorphic to G and regular on the points. 

Conversely, let G be given and let (X,.A) be asymmetric (v, k, X)- 
design with regular group G of automorphisms of (X, A) that is 
isomorphic to G. It will be sufficient to exhibit a (vu, k, A)-quotient 
set in G. 

Fix a point 2) € X and a block Ap € A. Let 


D:={0 €G: (29) € Ao}. 


We claim that D is a (v,k, \)-quotient set in G. Since G is regular 
and |Ao| = k, we have |D| = k. Let @ be a nonidentity element of 
G. Then aD = {ao : o(x%0) € Ao} = {7 : T(x0) € a(Ao)}, so 


DN (aD) = {7 : T(x) € Ap MN a(Apo) }. 


Now since G is regular, a has no fixed points and hence, by Theorem 
27.1, fixes no blocks. Thus the block a(Ap) is distinct from Ao, so 
|Ag Na(Ap)| = A, and by regularity, |DM(aD)| = A. This holds for 
all nonidentity elements a@ and establishes our claim. 


In particular, the existence of a cyclic (vu, k, \)-difference set, i.e. 
a difference set in Z,, is equivalent to the existence of a symmet- 
ric (v,k, A)-design that admits a cyclic automorphism, i.e. an au- 
tomorphism with cycle decomposition on the points—or blocks— 
consisting of one cycle of length v. 

We now restrict our attention to difference sets in abelian groups. 
Ideally, we would like to describe and classify all difference sets—to 
find out which groups have difference sets, how many there are, etc. 
This we shall do for various small parameter triples, but in general 
the existence problem alone is already extremely difficult. 
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Observe that D C G is a (v,k, X)-difference set if and only if 
G\ Disa(v,v—k,v—2k-+ \)-difference set. Thus we may confine 
our attention to the case k < 5U. 

Also note that D is a difference set if and only if every translate 
of D is a difference set. 

In the case that (v,k) = 1, it happens that we can choose a 
natural representative from the class of all translates; this can be a 
help in classification and will be useful in the next chapter. Call a 
subset of an abelian group G normalized in the case that the sum 
of its elements is zero. 


Proposition 27.4. Let D be a k-subset of an abelian group G of 
order v. If (v,k) =1, then D has a unique normalized translate. 


PrRooF: Let h be the sum of the elements of D. Then the sum of 
the elements of a translate D+ g ish +kg. Since (v,k) = 1, there 
is a unique group element g with h+kg = 0. 


The reader is invited to verify to his or her satisfaction that 
the normalized difference sets with parameters (v,k, A) = (7,3, 1), 
(13,4, 1), (11,5, 2), (21,5,1) are, respectively: 


{1,2,4}, {3,5,6} in Zz; 
{034,39}; {0;9, 5,6}, {0;4, 10,19},-40;7,8/11) in Zia: 
{1,3,4,5,9}, {2,6,7,8,10} in Zu; 
{7,14, 3, 6,12}, {7,14,9,15, 18} in Zor. 


(This will be easy after Theorem 28.3—see Example 28.2.) Of 
course, for (v,&) = 1, the total number of difference sets is v times 
the number of normalized difference sets, e.g. there are 52 difference 
sets with parameters (13,4, 1) in Zis. 

A final preliminary observation is that if @ is any automorphism 
of the group G, then a subset D C G is a difference set if and only 
if a(D) is a difference set. Thus from a given difference set we can 
obtain others by taking translates and by means of the symmetries 
of G. We say that difference sets D,, Dj in G are equivalent when 
there exists a € Aut(G) and g € G such that Dy = a(Di) + g. 
(Check that this is an equivalence relation.) The normalized dif- 
ference sets shown above are, for each parameter triple, equivalent; 
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indeed, each can be obtained from the others by multiplication by 
some integer relatively prime to the order of the respective group. 


Problem 27B. Show that all (16, 6, 2)-difference sets in Zy x Zz x 
Ly x Zy are equivalent. 


Problem 27C. Recall from Example 19.4 that symmetric designs 
with parameters v = 4t?, k = 2t?—t, \ = t?—t are related to regular 
Hadamard matrices. Let A and B be, respectively, (4x?, 27? — 
x, x? —x)- and (4y?, 2y? — y, y” — y)-difference sets in groups G and 
H (admit x or y = 1). Show that 


D:= (Ax (A \ B)) U((G \ A) x B) 
is a (427,227 — z,z* — z)-difference set in G x H where z = 2zy. 
(Thus if G is the direct product of m groups of order 4, then there 
is a (47,2-.4™-1 _ gm-1 4m-1 _ 9-1) difference set in G.) 


We describe several known families of difference sets. The known 
constructions all seem to involve finite fields and/or vector spaces. 
The first of these examples is essentially contained in Chapter 18 
on Hadamard matrices in the discussion of Paley matrices, but it 
is worth stating explicitly. Difference sets with parameters 


(v,k, A) = (4n — 1,2n —1,n—-1) 


are often called Hadamard difference sets. 


Theorem 27.5 (Paley, Todd). Let q = 4n—1 be a prime power. 
Then the set D of nonzero squares in F, is a (4n—1,2n—1,n—1)- 
difference set in the additive group of F,. 


PROOF: Clearly, |D| = 2n — 1. 

Since D is invariant under multiplication by elements of the set 
S of nonzero squares, the multiset M of differences from D also has 
this property. Also, M is obviously invariant under multiplication 
by —1. Since gq = 3 (mod 4), —1 ¢ S and every nonzero element of 
F, is either in S or of the form —s for some s € S. In summary, M 
is invariant under multiplication by all nonzero elements of F,, and 
so D is a (4n — 1, 2n — 1, X)-difference set for some . The relation 
A(vu — 1) = k(k — 1) forces X=n—-1. 
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We obtain difference sets {1, 2,4} in Z7, {1,3,4,5,9} in Z,, and 
{1, 4, 5, 6, 7,9, 11, 16,17} in Z9 from Theorem 27.5. The (27, 13, 6)- 
difference set later in the series will be in the elementary abelian 
group of order 27, not Zo7. 


Problem 27D. Show that the Paley—Todd difference sets are nor- 
malized if q > 3. 


Stanton and Sprott (1958) found another family of Hadamard 
difference sets. 


Theorem 27.6. If q and q+ 2 are both odd prime powers, then 
with 4n—1 := q(q+2), there exists a (4n—1, 2n—1, n—1)-difference 
set in the additive group of the ring R := F, x Fj42. 


Proor: Let U := {(a,b) € R: a 4 0,b F O} be the group of 
invertible elements of R. Let V be the subgroup of U consisting of 
those pairs (a, b) such that both a and b are squares in the respective 
fields F, and F,42 or both a and b are nonsquares. Check that V 
is an index 2 subgroup of U, and also that (—1,-1) ¢ V. Put 
T := EF, x {0}. We claim that = T UV is a difference set as 
required. We do have |D| = q+ $(q¢—1)(¢+1) = 2n-1. 

Since D is invariant under inultiphication by elements of V, the 
multiset of differences from D also has this property as well as the 
property that it is invariant under multiplication by (—1,—1). So 
the multiset of differences from D is invariant under multiplication 
by elements of all of U. Thus every element of U occurs as a 
difference the same number of times, say 1. Every element (x, 0) 
of R with x ¥ 0 will occur, say, Aj times; and every element (0, y) 
of R with y # 0 will occur, say, 3 times as a difference from D. 
Of course, we have 


(27.1)  k(kK-1) = (q—-D(q+ A+ (¢—DA2+ (94+ 1). 


It is easy to evaluate A2 and A3. Differences of the form (2,0), 
x # 0 (there are g— 1 elements of this form) arise q(q — 1) times 
from T’, never as a difference between an element of T and V, and 
(¢+1)-($(q—1))(§(q—1) — 1) times as differences of two elements 
of V; thus 


(q~1).2 = a(g-1) + (9 +1)-G@-Y)G@- 1) -1) 
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from which we conclude A = 4(q + 3)(q— 1). In a similar manner 
(Problem 27E), the reader will find that 43 = $(q¢+3)(q—1). Then 
(27.1) implies that A; = $(q + 3)(q — 1) also. 


Problem 27E. With the notation as in the proof of Theorem 27.6 
prove that 43 = 4(q + 3)(q—1). 


In the case that q and q + 2 are both primes (twin primes), 
Theorem 27.6 yields cyclic difference sets. For if s,t are relatively 
prime integers, Z, x Z; and Z,; are isomorphic as additive groups— 
and also as rings. An isomorphism Z, — Z, x Z; is provided by 
x (mod st) ++ (x (mod s),xz (mod t)) . We obtain the (15,7, 3)- 
difference set {0,5,10,1,2,4,8} in Zy5. 


Problem 27F. List the elements of a cyclic (35, 17, 8)-difference 
set. 


As a special case of Singer’s theorem, Theorem 27.7, we will 
obtain another family of Hadamard difference sets: cyclic difference 
sets with 


v=2-1, k=2 1-1, A=2°7%-1. 


Recall from Chapter 23 that the points and the hyperplanes of 
PG(n,q) form a symmetric design with 


n+l _o4 
(27.2) a 
q-1 


q-1 


Theorem 27.7. For any prime power q and positive integer n, 
there is a difference set D with parameters as in (27.2) in the cyclic 
group of order v so that the resulting symmetric design is isomor- 
phic to the points and hyperplanes of PG(n,q). 


PROOF: In view of Theorem 27.3, we need only show that there 
exists an automorphism of PG(n,q) that permutes the points in a 
single cycle of length v, or equivalently so that the powers of the au- 
tomorphism act transitively on the projective points. The points of 
PG(n,q) are the 1-dimensional subspaces of an (n + 1)-dimensional 
vector space V over F,. Any nonsingular linear transformation T 
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from V to itself will take subspaces to subspaces of the same di- 
mension and thus gives us an automorphism of PG(n, q). 
As an (n+ 1)-dimensional vector space over F,, we choose V := 


Kynti as a vector space over its subfield F,. Let w be a primitive 


element of Fj.+1 and consider the linear transformation T : 7 +> wx 


of V over F,. It is clear that Tis nonsingular and that its powers are 
transitive on the projective points (even on the nonzero vectors!). 


The difference sets constructed in the proof of Theorem 27.7 
above are called Singer difference sets—see Singer (1938). We give 
a more concrete discussion and an example. Let w be a primitive 
element of Fjn+1 and define v := (q"t! — 1)/(q— 1). The cyclic 
multiplicative group (w) of Fjn+: has a unique subgroup of order 


q —1, namely, 


(w) = {w® = 1", w,..., w?)”}. 
But the multiplicative group of the subfield F, has order q — 1, so 
we conclude F, = {0,w®, w”, w”,...,w(9-?)}. 


Now two ‘vectors’ w’ and w in Kyn+1, considered as a vector space 
over F,, represent the same 1-dimensional subspace of Fyn+i if and 
only if w' = au’ for some 0 4 a € F,, that is, if and only if i = j 
(mod v). Thus we have a one-to-one correspondence between the 
set X of 1-dimensional subspaces (projective points) and the group 
Z,, of residue classes modulo v: 


OH ty = {0,09 wr" wr?) sgt 


~2)y+1 
Leg = 10,0 wr we Re ee } 


LA SAO i at oa 


v-1 2v-1 3u-1 —1)v-1 
b= TS a= HO ot ow wee 


The map 2; +> 2;4, (subscripts modulo v) is an automorphism of 
the projective space. To obtain a difference set, let U be any n- 
dimensional subspace of Fjn+1 and let D := {i € Z,: a2; ¢ U}. A 
normalized difference set is obtained if U is taken to be the subspace 


of elements with trace zero (the trace from Fjn+:1 to F,). 
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Example 27.2. Consider n = 2, q = 5. We constrict a (31,6, 1)- 
difference set. A zero w of the polynomial ytyt+2 (eostnsionts 
in F;) is a primitive element of F533 and 1, w, w? furnish a basis for 
3 as a vector space over F;. The 124 nonzero elements of Fs3 fall 
into 31 cosets modulo the subgroup (w!) = {3,4,2,1} = EF; \ {0}, 
each coset being the nonzero elements of a 1-dimensional subspace. 
Let us take U := span{1,w} as our 2-dimensional subspace. Rep- 
resentatives of the projective points on U are 1,w,w+1,w+4+2, 
w+3,w+4 and, after some computation, 


1=v 


w=w! 
wtl=w* 
ii 2 Sa? 
wt3=w 
wt+4=% 


The resulting Singer difference set is 


{0, 1, 29,6, 18,22} in Zs. 


Problem 27G. Find a (57,8, 1)-difference set in Z57. (Some will 
probably not want to do this by hand.) 


Consider n = 3, q = 2. Here the points of PG(3, 2) are in one-to- 
one correspondence with the nonzero elements of Fy1 that in turn 
are in one-to-one correspondence with the residues modulo 15. It 
may be instructive to write out all lines and planes of this smallest 
projective 3-space so that the cyclic automorphism is clear. 


A zero w of y4+y+1 (coefficients in F; ) is a primitive element of 
Fy and w®,w*,w,1 from a basis for Fy over Fy. Any element of Fy 
can be written uniquely as a3w* +agw?+a,w+ay that we abbreviate 
as a30201a9 below. We have w4+w+1=0, or wt = 0011. We first 
construct a table of vector representations of powers of w. 
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{0,5,10} 
{1,6,11} 
{2,7,12} 
{3,8,13} 
{4,9,14} 
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PG(3, 2) 


Lines 

{0,1,4}  {0,2,8} 
{1,25} {1,3,9} 
{2,36}  {2,4,10} 
{34,7} {3,5,11} 
{4,5,8}  {4,6,12} 
{5,6,9}  {5,7,13} 
{6,7,10}  {6,8,14} 
{7,8,11}  {7,9,0} 
{8,9,12}  {8,10,1} 
{9,10,13} {9,11,2} 
{10,11,14} {10,12,3} 
{11,12,0} {11,13,4} 
{12,13,1} {12,14,5} 
{13,14,2} {13,0,6} 
414,0,3} - 414,17) 


wl = 0111 
wit = 1110 
wi? = 1111 
wi = 1101 
wit = 1001 


Planes 
{1,2,4,8,0,5,10} 
{2,3,5,9,1,6,11} 
{3,4,6,10,2,7,12} 
{4,5,7,11,3,8,13} 
{5,6,8,12,4,9,14} 
{6,7,9,13,5,10,0} 
{7,8,10,14,6,11,1} 
{8,9,11,0,7,12,2} 
{9,10,12,1,8,13,3} 
{10,11,13,2,9,14,4} 
{11,12,14,3,10,0,5} 
{12,13,0,4,11,1,6} 
{13,14,1,5,12,2.7} 
{14,0,2,6,13,3,8} 
{0,1,3,7,14,4,9} 


Note that for g = 2, the Singer difference sets have Hadamard 
parameters. We obtain (31,15, 7)-difference sets both from The- 


orem 27.5 and Theorem 27.7. These two difference sets are not 


equivalent—and not even isomorphic. (Two difference sets are iso- 
morphic when the corresponding symmetric designs are isomorphic. 
Equivalent difference sets are surely isomorphic, but the converse 
is not true.) The difference set D of quadratic residues modulo 31 
has the property that 


DA(D+1)N(D +3) = {5,8, 10, 19}. 
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The design (Z31,{D +g : g € Zs:}) cannot be isomorphic to the 
points and hyperplanes of PG(4, 2) since the intersection of flats is 
again a flat and PG(4,2) has no flats with precisely four points. 
Gordon, Mills, and Welch (1962) have shown that the construc- 
tion of Singer’s theorem can be modified in some cases to produce 
many nonequivalent difference sets with the same parameters. 


Problem 27H. Let D be an (n? +n+1,n +1, 1)-difference set 
in an abelian group G. Show that —D is an oval in the associated 
projective plane. 


Problem 271. Let G = {0,ao,a1,...,a,} be any group of order 
q+2, where q is a power of a prime. Let V be a 2-dimensional vector 
space over F, and let Up, Uj,...U, be its 1-dimensional subspaces. 
Show that 


q 
D:= fai} x U; 
i=0 
is a difference set in G x V. For example, we obtain a (45,12,3)- 
difference set. 


Notes. 

The idea of group difference sets, as a generalization of cyclic 
difference sets, is due to R. H. Bruck (1955). 

Problem 27] is a result of McFarland (1973). 

Theorem 27.2 holds also for arbirary 2-designs, simple or not. Of 
course, one must define an automporphism of an arbitrary incidence 
structures S = (P,6,1). Perhaps the most precise definition is as 
an ordered pair (a, 3), where a is a permutation of P, 3 is a permu- 
tation of B, and such that (x, A) € Lif and only if (a(x), G(A)) € I. 
The automporphisms form a group under coordinate-wise composi- 
tion. The permutations a that occur as first coordinates of a group 
T of automorphisms form a group I; of permutations of the points; 
the permutations @ that occur as second coordinates of a group [ 
of automorphisms form a group [2 of permutations of the blocks. 
By the orbits of [ on the points, respectively blocks, we mean the 
orbits of [, respectively To. 
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Difference sets and the 
group ring 


The group ring provides a natural and convenient setting for the 
study of difference sets. The existence of a difference set will be 
seen to be equivalent to the existence of a solution to a certain 
algebraic equation in a group ring. We shall use the group ring 
to derive the celebrated Multiplier Theorem of M. Hall, Jr. as well 
as number-theoretic criteria on the parameters of a difference set 
which are stronger than the Bruck-Ryser-Chowla theorem. 

Let R be a ring (commutative with one) and G a finite abelian 
group (written additively). The elements of the group ring R[G| 
are all formal sums 

A= S- Gig ht", 


geEG 
where a, € R for each g € G. (Here the symbol z is just a place 
holder. The important thing is that we have an element a, of R 
for each element g of G, i.e. the elements of the group ring are in 
one-to-one correspondence with mappings G — R.) 
We define addition and scalar multiplication in the obvious way: 


S- aga S° bya? = So (ay bg Jae; 


gEG gEeG geG 
oom g 
é ) age y (Cag) x": 
gEéG gEG 


Multiplication in R[G] is defined by 


S- aga! S- bx? := ys S- aybpe |e? 


gEG gEG geG \h+h’=g 
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With these definitions, R[G] is a commutative, associative R- 
algebra. The notation we have chosen is appropriate for abelian 
groups and emphasizes an analogy with polynomials. Note that 
when G is the additive group of the residues modulo v, then the 
group ring R[Z,] consists of all sums ee. r;x’ (exponents modulo 
v) and is isomorphic to the factor ring Riax|/(a” — 1) of the polyno- 
mial ring R[x]. In general, we shall even take to denoting elements 
of arbitrary group rings R[G] as A(x), B(ax),... when convenient. 
The element x° € R[G] is the multiplicative identity in R[G] and 
we denote x° by 1. 

We shall be concerned almost exclusively with the group rings 
Z|G] over the integers. For a subset A C G, we define A(x) € Z[G] 


by 
A(z) = So 29. 


gEA 
In particular, G(r) = Di) <q a9. For A, BCG, 
Ale) Ba): = Roe 
gEG 
where ¢, is the number of times g occurs in the multiset 
(h+h’:heA,h' € B) 
of sums of elements of A and B. 
For A(x) = di ycq 4gt? € Z[G], we write 
Ae) Sage: 
gEG 


So for a k-subset D of a group G of order v, D is a (v, k, \)-difference 
set in G if and only if the equation 


D(z)D(a7') =n + rAG(a) 


holds in the group ring Z[G], where n := k — X. 
An important homomorphism from Z/G] to Z is given by 


A(z) + A(1) = $ > ay € Z. 


gEG 
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Proposition 28.1. Let v,k,X be positive integers such that 
A(u — 1) = k(k — 1) 


and let G be an abelian group of order v. The existence of a 
(uv, k, \)-difference set in G' is equivalent to the existence of an ele- 
ment A(x) € Z|G] satisfying the equation 


(28.1) A(x) A(x") = n+ AG(z), 


where n:= k— X. 


PROOF: We have already pointed out that for a subset D of G, 
D(a) satisfies (28.1) if and only if D is a (v,k, A)-difference set. It 
remains to show that if there is a solution A(x) to (28.1), then we 
can find a solution B(x) = 5°46 x4 where the coefficients b, are 
0’s and 1’s. 

Assume that A(z) satisfies (28.1) and apply the homomorphism 
‘c++ V from Z[G] — Z. We find 


gEG 


(A(1))? =n +0 =k’, 


so A(1) = k or —k. Now if A(x) satisfies (28.1), then so does 
B(x) = —A(x), so we may assume A(1) = )o a, = k. 

The coefficient of 1 = x° in A(x)A(a~!) is k = aes a. Thus 
gcc Ag(Ag — 1) = 0. But a(a — 1) is strictly positive unless the 
integer a is 0 or 1. 


For any integer t, g > tg is a homomorphism of the group G into 
itself and induces a ring homomorphism Z[G] — Z[G], namely 


A(x) = ye age! re A(a):= ye 


géG gEG 


For A, B € Z|G] and n € Z, we say A= B (mod n) when A— B= 
nC for some C' € Z|[G]. Here is an easy lemma we will use often. 


Lemma 28.2. Let p be a prime and A € Z|G]. Then 


(A(a))? = A(x?) (mod p). 
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PROOF: We proceed by induction on the number of nonzero co- 
efficients of A(x). The lemma holds for A(x) = 0. Now if A(x) = 
cxzI + B(x) where B?(x) = B(x?) (mod p), then 


A? (ax) = (cx9 + B(ax))? = (cx9)? + B(x) 
= Px?9 + B(x) = cx?9 + B(x?) = A(z”), 


where the congruences are modulo p. 


Let G be an abelian group and D a difference set in G. An 
automorphism a of G is said to be a multiplier of D if and only if 
the difference set a(D) is in fact a translate of D, i.e. if and only 
if a(D) = D+ q for some g € G. For example, the automorphism 
x ++ 3x of Zi3 is a multiplier of the (13, 4, 1)-difference set {0,2,3,7} 
since {0,6,9,8} = {0,2,3,7} + 6. 

If t is an integer relatively prime to the order of G, then x + tx 
is an automorphism of G since the maps x +> ta and x + tox 
coincide if and only if t; = t2 (mod v*) where v* is the exponent 
of G, i.e. the least common multiple of the orders of the elements 
of G. If x +> ta is a multiplier of a difference set D in G, we say 
that t is a numerical multiplier or Hall multiplier of D. It is at first 
surprising that many difference sets (e.g. all known cyclic difference 
sets) must necessarily have a nontrivial numerical multiplier. 


Problem 28A. Let gq = p', p prime. Show that p is a multiplier 
of the Singer difference sets described in the previous chapter. 


Observe that an automorphism a of G is a multiplier of a dif- 
ference set D in G if and only if in the group ring Z[G], D(x*) = 
x! - D(x) for some g € G. 


Theorem 28.3 (Multiplier Theorem, first version). Let D 
be a (v,k, A)-difference set in an abelian group G of order v. Let p 
be a prime, p|n, (p,v) =1, p > A. Then p is a numerical multiplier 
of D. 


It will be convenient to isolate part of the proof as a lemma. 


Lemma 28.4. Let a be an automorphism of G and let D be a 
(uv, k, \)-difference set in G. Consider 


S(a) := D(x2*)D(2~') — AG(a). 
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Then a is a multiplier of D if and only if S(x) has nonnegative 
coefficients. 


PROOF: We begin by remarking that if a is a multiplier, then we 
have D(x) = x9 - D(x) for some g € G, and then 


D(x") D(a) = 29 - D(x) D(a~") = w9(n + AG(x)) = na? + AG(a). 


So in this case, S(x), as defined above, is equal to nx’ for some 
g € G. In particular, it has nonnegative coefficients. Note that, 
conversely, if D(a*)D(a~') = nz9 + \G(x), we can multiply this by 
D(a) to find 


D(a") - (n+ AG(ax)) = nx - D(x) + AD(x)G(a), 


nD(a2") + ARG(x) = nx9 - D(x) + ARG (ax). 


So D(a2*) = x9 - D(x) and a is a multiplier. 
Now ‘x +> x’ is an automorphism of Z[G], so D(a®)D(a2~°) = 
n+ AG(x), i.e. a(D) is also a difference set, and 


S(a)S(a™) = {D(x*) D(a") — AG(2) }{ D(a) D(z) — AG(a)} 
= {n+ rAG(a«)}? — 2AkK’G (x) + °vG(z) 
=n? +2d(n+ du — k’)G(a2) =n’. 


Suppose that S(x) = }/,<q S27 with nonnegative coefficients 54. 
If s, > 0 and s, > 0 for g,h € G, then the coefficient of x29~” 
in S(x)S(a~'), = n?, is at least s sp, i.e. strictly positive, and so 
x9" = 7°, i.e. g = h. So S(x) can have only one positive coefficient, 
say S(x) = 5,29. The equation $(x)S(2~!) = n? forces sy, = n and 
we have shown S$(x) = na. As noted above, we may conclude that 
qa is a multiplier. 
PROOF OF THEOREM 28.3: Let S(x) := D(x?)D(a~!) — AG(a). 
By Lemma 28.4, it will suffice to show that S(xz) has nonnegative 
coefficients. By Lemma 28.2, 


D(x”) D(a) = D?(x) D(a!) = D?\ (2) D(x) D(a") 
= D?(x) - (n+ AG(x)) = nD?“1(x) + AK? 1G (a) 
= AG(x) (mod p), 
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since p divides n and \k?-! = \P = X (mod p). Thus the co- 
efficients of D(x?)D(x~'), which are clearly nonnegative, are all 
congruent to A modulo p. Since p > X, it must be that the coeffi- 
cients of D(x?) D(a~") are greater than or equal to A, i.e. $(x) has 
nonnegative coefficients. 


Problem 28B. Find an element S$(x) in Z[Z;] with the property 
that S(x)S(271) = 4, but with S(x) 4 4229. 


Note that the hypothesis p > A was essential for our proof. Yet in 
every known difference set, every prime divisor of n (not dividing 
v) is a multiplier. It may be, then, that the hypothesis p > 
is unnecessary and by now the question of whether this is so has 
acquired the status of a classical unsolved problem. There have also 
been several generalizations of the original Multiplier Theorem, all 
of which would be trivial if we could eliminate the condition p > A, 
and we state such a generalization below after first giving some 
applications of the current version. 


Corollary. For \ = 1, every prime divisor of n, and hence every 
divisor, is a multiplier of every (n? + n+ 1,n +1, 1)-difference set. 


Example 28.1. We claim that there are no (n?+n+1,n+1,1)- 
difference sets with n = 0 (mod 6). Let D be such a hypothetical 
difference set, and without loss of generality, assume D is normal- 
ized so that it is fixed by all multipliers. Both 2 and 3 would be 
multipliers. Then for « € D, 2x and 3x also are in D. Then the 
difference x occurs twice, once as 2% — x, and once as 3x — 22; 
these are different occurrences as long as 3x 4 22, i.e. x £0. This 
contradicts A = 1. 


Problem 28C. Show that there are no (n? + + 1,n + 1,1)- 
difference sets with n divisible by any of 10, 14, 15, 21, 22, 26, 34, 
35. 


It has been possible to prove, with the Multiplier Theorem and 
other techniques, that no planar difference sets exist for n < 3600 
unless n is a prime power. But the conjecture that n must be a 
prime power remains open. 


Example 28.2. Consider a normalized (21,5, 1)-difference set D 
in Z;. By the Multiplier Theorem, 2 is a multiplier and hence 
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2D = D. Thus D must be the union of the cycles of 7 + 2x on 
Zy,. These are 


{0}, {1,2,4,8,16,11}, {3,6,12}, {5,10,20, 19,17, 13}, 
{7,14}, and {9,18,15}. 
But D has 5 elements, so 7f there is such a difference set, D must be 
{7, 14, 3,6, 12} or {7,14,9,18,15}. It turns out that both of these 


are difference sets. One consists of the negatives of the other, so 
the 42 difference sets with these parameters are all equivalent. 


Problem 28D. Find all normalized difference sets with parame- 
ters (7,3, 1), (11,5, 2), (18, 4,1), (19,9, 4), (31, 10,3), and (37, 9, 2). 
(Note: no difference sets exist for one of the parameter triples.) 


We give two more easy lemmas on the group ring at this point. 


Lemma 28.5. Let G be an abelian group of order v and p a prime, 
pt{v. Let A € Z[G| and suppose A” = 0 (mod p) for some positive 
integer m. Then A = 0 (mod p). 


PROOF: Choose a power gq = p® of p such that gq > m and q=1 
(mod v). Then surely A%(z) = 0O (mod p). But by Lemma 
28.2, Al(x) = A(x’) (mod p), so A(x?) = 0 (mod p). Since gq = 1 
(mod v), gg = g for every g € Gand A(x) = A(z’). 
Note that x9 - G(x) = G(x) in Z|G]. It follows that 


A(x)G(x) = A(1)G(a). 


For n € Zand A, B € Z/G], we say A= B (mod n,G) when A—B 
is an element of the ideal in Z[G] generated by n and G = G(x), or 
equivalently, when 


A-B=nC+mG 
for some C' € Z[G]. 


Lemma 28.6. Let G be an abelian group of order v and p a prime, 
p{v. If A € ZG] and 


"=0 (mod p, G) 
for some positive integer m, then A =0 (mod p,G). 


PROOF: Choose q = p® with gq = 1 (mod v) and q > m. Then 
A(x) = 0 (mod p,G) and A%(x) = A(x’) = A(x) (mod p). 
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Theorem 28.7 (Multiplier Theorem, second version). Let 
D bea(v,k, X)-difference set in an abelian group G' of exponent v*. 
Let t be an integer, (t,v) = 1, and suppose we can find a divisor 
m of n := k —X such that m > X and for every prime divisor p 
of m, there is an integer f for which p! = t (mod v*). Then t is a 
numerical multiplier of D. 


PROOF: The proof will use Lemmas 28.2, 28.4, 28.5 and the fol- 
lowing observation: 

Let D be a (vu, k, A)-difference set in G, a an automorphism of G, 
and put S(a#) := D(x*)D(x~!) — \G(x). Assume that a has order 
e, so that a° = identity. Then, we assert, in the group ring Z|G], 


2 


S@) Se" Sta js ae 


73) =n. 


To see this, note that for any integer 7, we have 


D(x“ )D(a-) =n +AG(z) =n (mod G) 


2 e—1 


S(Z)S(a\sla) 26 Sa?) 
= {D(2")D(x) }{D(2"") D(a} ++ (D(x) D(a* 


= {D(2)D(e){D(2")D(a)} + (D(a) D(e")} 


Thus S(x)$(x%)---S(x*) = n° + G(x) for some integer ¢. But 
S(1) = (D(1))? — AG(1) = k? — Av = n; so applying the homomor- 
phism x +> 1, we find n° = n* + év, and hence ¢ = 0. 

To continue with the proof, let S(x) := D(a#')D(a~') — AG(z). 
By Lemma 28.4, to show t is a multiplier, it will be sufficient to 
prove that S(xz) has nonnegative coefficients. Each coefficient of 
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S(ax) is at least —A in value and since m > A, the nonnegativity 
of the coefficients will follow if we can establish that S(x) = 0 
(mod m). To establish this, it will suffice to show that S(x) = 0 
(mod p’) whenever p is prime and p’ divides m. This we do below. 

Let e be the order of t modulo v*, so that t*° = 1 (mod v*). As 
shown above, 


S(x)S(x")S(x")---S(x") = n°. 


Let p be a prime divisor of m and let f be such that pf = t 
(mod v*). Then 


S(x)S(a?!)S(a?*)--- S(aPY) = ne. 


Let p’ be the highest power of p dividing n and let p’ be the 
highest power of p dividing (all coefficients of) S(x), so S(#) = 
p/T(x) where T(x)40 (mod p). Then 


pT (x) p'T (x) -- pT (x) = n°, 
from which it follows that p’ divides n (so j < i) and 


ys 


n 


Pty (gre) = 
T(x)T (a? )--- TC FN 


Suppose that 7 < i, so that om is divisible by p. Then 
T(2?") he Tg”) 

T?’ (x) ++» T(a)T? (a) 

= Gn ae cic aa (mod p). 


But then, by Lemma 28.5, Ta) = 0 (mod p), contradicting the 
choice of 7. Thus 7 = 7. 


Corollary. If n = p*, p prime, (p,v) = 1, then p is a numerical 
multiplier of every (v, k, X)-difference set. 


Proor: A difference set D and its complement G \ D have the 
same multipliers. Thus we may assume that k < 5U and hence 
n> A. In Theorem 28.7, take m =n, t = p. 
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Example 28.3. Consider a hypothetical (25,9,3)-difference set. 
Take t = 2 and m = 6 in the above theorem. Since 3°? = 2 
(mod 25), we may conclude that 2 is a multiplier. 


Problem 28E. Find all normalized difference sets with parame- 
ters (15,7,3), (25,9, 3), (39,19,9), (43, 15,5), and (61,16,4). De- 
termine, for each parameter set, whether these normalized differ- 
ence sets are equivalent. (Note: difference sets do not exist for most 
parameter sets.) 


Problem 28F. Let D bea nontrivial (v, k, A)-difference set. Prove 
that if —1 is a multiplier of D, then v is even. 


Problem 28G. Find a (36,15,6)-difference set in Zg x Ze with —1 
as a multiplier. 


Suppose D is a (v,k, A)-difference set in an abelian group G of 
even order v. By Theorem 19.11(i), we know that n is a square. 
But in the case of difference sets, we can say what n is the square 
of! Let A be any subgroup of G of index 2. Say D contains a 
elements of A and b elements of B := G \ A. Since every element 
of B occurs A times as a difference from D, and only differences of 
one element in A and one in B lie in B, 2ab = 5Av. From this and 
a+b=k, we find that (a —b)? =n. 

Now suppose v is divisible by 3, let A be a subgroup of G of 
index 3, and let B and C be the cosets of A in G. Say D contains a 
elements of A, b elements of B, and c elements of C. The number 
of differences from D which lie in B, say, is ba + cb+ ac, which on 
the other hand must be 5Av. From this anda+b+c=k, we can 
check that 4n = (b +c — 2a)? + 3(b — c)?. Now it is not true that 
every integer can be written as the sum of a square and three times 
another square, so this condition rules out the existence of certain 
difference sets. For example, there are no (39, 19, 9)-difference sets, 
even though there are symmetric designs with these parameters, 
because 4n = 40 cannot be written as above. 

We generalize the above necessary conditions on the parameters 
of difference sets by considering homomorphisms of the group ring. 
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If ~ is a homomorphism G — H, then a induces a ring homomor- 
phism 7. — ZH}: 


= age! re A(z =) aa [H]. 


Theorem 28.8. Let D be a (v,k, ae set in an abelian 
group G. Let u > 1 be a divisor of v. If p is a prime divisor of n 
and 

p! =-1 (mod u) 
for some integer f, then p does not divide the squarefree part of n. 


PROOF: We prove something stronger. Let a: G — H be a homo- 
morphism onto a group H of order u and exponent u*, say. Suppose 
p is prime and pf = —1 (mod u*) for some f. We will show that n 
is exactly divisible by an even power p*! of p and that for this j, 


D(x*)=0 (mod p’,H) in Z[H]. 
In other words, all the coefficients of D(x®), which are the numbers 
of elements of D belonging to the various cosets of the kernel of a, 
are congruent to each other modulo p’. 
Let p’ be the highest power of p which divides n. Our hypothesis 
p! = —1 (mod u*) means that D(x~°) = D(x?’“) in Z[H] (where 
we will do our calculations), so we have 


(28.2) D(a°)D(2?’%) = 
D(x*)D(a-*) =n + A-H(2) =0 (mod p’, H). 


Let p’ be the highest power of p so that D(x°) = 0 (mod p’, H); 
say D(x“) = p’ A(x) (mod H). The reader may check that (28.2) 
implies that 27 < i and that if 27 < i, then A(x)A(x?’) = 0 
(mod p,H). But then Lemmas 28.2 and 28.6 imply in turn that 
A(x)!+?’ = 0 (mod p,H) and then A(x) = 0 (mod p, H). This 
contradicts the choice of 7, so 27 = 1%. 


Consequences of Theorem 28.8 are: if v is divisible by 3, then 
all prime divisors of the squarefree part of n are congruent to 0 
or 1 (modulo 3); if v is divisible by 5, then all prime divisors of 
the squarefree part of n are congruent to 0 or 1 (modulo 5); if uv 
is divisible by 7, then all prime divisors of the squarefree part of n 
are congruent to 0, 1, 2, or 4 (modulo 7). 
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Example 28.4. We give an application of the stronger claim given 
in the proof of Theorem 28.8. Consider a hypothetical (154, 18, 2)- 
difference set D in G. Let a: G — H be a homomorphism onto a 
group H of order u := 11. Take p := 2. Since 2° = —1 (mod 11), 
we conclude that D(x°) = 0 (mod 4,H). This means the 11 co- 
efficients of D(«®) are all congruent to some integer h modulo 4; 
since the 11 coefficients sum to 18, h is 2. But then the coefficients 
would sum to at least 22 and this contradiction shows that no such 
difference sets exist. 


Problem 28H. Let D bea (q/+q?+1, ¢°+1, 1)-difference set in an 
abelian group G. Let a: G — H bea homomorphism onto a group 
FH of order u := q?—q+1. Show that any coset of the kernel of a is 
the point set of a Baer subplane of the projective plane of order q? 
arising from D. For example, the 21 points of the projective plane 
of order 4 are partitioned into three Fano configurations. 


Problem 281. Let D be a(qg+q’?+q4+1,¢+¢+1,q+1)-difference 
set in an abelian group G. Let a: G — H be a homomorphism 
onto a group H of order u := q+ 1. Show that the translates of D 
meet the cosets of the kernel of a in g+ 1 points or a single point. 
Show further that if the symmetric design arising from D consists 
of the points and planes of PG3(q), then the cosets are ovoids (see 
Example 26.5). 


Notes. 

The celebrated Multiplier Theorem was first proved in the case 
of cyclic planar difference sets in Hall (1947). It was generalized to 
\ > 1 by Hall and Ryser (1951) and since then has been extended 
in many ways—see e.g. Mann (1965), Baumert (1971), and Lander 
(1983). Algebraic number theory and characters of abelian groups 
often play a role in the proofs of these results, but we have chosen 
here to give proofs of Theorem 28.7 and 28.8 using only the group 
ring. 

Marshall Hall (1910-1990) did fundamental work in group theory 
and coding theory as well as on combinatorial designs. He has 
been a tremendous influence on many mathematicians, including 
the authors. 
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Theorem 28.8 is due to K. Yamamoto (1963). 
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Codes and symmetric designs 


In this chapter, we elaborate on some of the material introduced in 
Chapter 20. We saw in that chapter that the rows of the incidence 
matrix of a projective plane of order n = 2 (mod 4) span a binary 
code which, when extended, is self-dual. It was observed by H. A. 
Wilbrink (1985) and others that that result can be used to show 
that planar difference sets do not exist for values of n > 2 with 
n = 2 (mod 4); see Theorem 29.7. 

One can consider the code spanned by the rows of the incidence 
matrix over other prime fields F, as well. With essentially the same 
proof as that of Theorem 20.6, we have the following theorem. 


Theorem 29.1. Ifp divides n := k— 4, then the F,-span C' of the 
rows of the incidence matrix N of a symmetric (v, k, )-design has 
dimension at most (v + 1)/2 over F,. If (p,k) = 1 and p” does not 
divide n, then the dimension of this p-ary code is exactly (v+1)/2. 


Problem 29A. Prove Theorem 29.1. 


It is not possible, in general, to extend the code C’ to get a code 
of length v + 1 that is self-orthogonal with respect to the standard 
dot product on F’. But let us consider other ‘scalar products’ for 
odd primes p. 

To anonsingular m x m matrix B over a field F, we may associate 
the scalar product (or bilinear form) 


(x,y) == xBy" 


for x,y € F”. For a subspace C' of F’”, let 


C® := {x: (x,y) =0 for all y € C}. 
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Then C and C® have complementary dimensions and (C?)? = 
C. We say that C is totally isotropic when C C C®; this is the 
appropriate terminology for this generalization of self-orthogonal. 
We have use for the following theorem of Witt, and we include a 
proof here as it is essentially a rephrasing of part of Theorem 26.6. 


Theorem 29.2. Given a symmetric nonsingular matrix B over a 
field F of odd characteristic, there exists a totally isotropic subspace 
of dimension m/2 in F™ if and only if (—1)"/? det(B) is a square 


ni 


in F. 


Proor: A totally isotropic subspace of dimension m/2 is, when 
considered as a set of projective points in PG(m — 1,2), a flat of 
projective dimension (m/2) —1 that is contained completely in the 
quadric Q defined by the nondegenerate quadratic form 


f(x) =xBx'. 


By Theorem 26.6, such a subspace exists if and only if Q is hyper- 
bolic. It remains only to verify that Q is hyperbolic if and only if 
(—1)/? det(B) is a square in F. 

If f, is a form projectively equivalent to f, then f;(x) = xB,x" 
where B; = UBU' for some nonsingular matrix U. Of course, 
(—1)'"/? det(B) is a square if and only if (—1)"”? det(B;) is a square. 

If Q is hyperbolic, then f is equivalent to fj = x1%9+---+2%m—-1%m 
and B, is block diagonal with m/2 matrices 


on the diagonal. Then (—1)'/? det(B,) = 1/2”, which is a square. 
If Q is elliptic, then f is equivalent to f, = x1 %o+-+-+2%m—3%m—2+ 
P(@m—1,2m) Where p(z,y) = ax? + 2bry + cy? is an irreducible 
quadratic form over F, and here B, is block diagonal with (m/2)—1 
matrices W and one matrix 
@? .D 
co 


on the diagonal. Then (—1)'/? det(B,) = (b? — ac)/2™~?, and this 
is a nonsquare in F since p is irreducible. 
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It will be convenient to retain the term self-dual for a totally 
isotropic subspace of dimension equal to half its length. E. S$. Lan- 
der (1983) has shown how to associate a family of p-ary codes of 
length v + 1 and a scalar product to a symmetric design in such a 
way that one of the codes is self-dual when n is exactly divisible 
by an odd power of p. Theorem 29.2 then gives us a condition on 
the parameters of the symmetric design. It turns out that these 
conditions are already consequences of the Bruck—Ryser—Chowla 
theorem, Theorem 19.11. In some sense, then, these self-dual codes 
provide a ‘combinatorial interpretation’ of part of the BRC theo- 
rem. In any case, these codes carry information about the design 
and have further applications in the theory of symmetric designs— 
see Lander (1983). 


Theorem 29.3. Suppose there exists a symmetric (v, k, \)-design 
where n is exactly divisible by an odd power of a prime p. Write 
n = p!ng (f odd) and X = p> with (no,p) = (Ao,p) = 1. Then 
there exists a self-dual p-ary code of length v + 1 with respect to 
the scalar product corresponding to 


_ { diagQ,1)-.<,1,—Ao) if b-is-even, 
~ | diag(1,1,...,1,n0A0) if b is odd. 
Hence from Theorem 29.2, 
—(—1)*)/?)o is a square (mod p) _ if b is even, 
(—1)tY) noo is a square (mod p) _ if b is odd. 
Towards the proof of Theorem 29.3, we first prove two proposi- 
tions. 
Given any integral m x m matrix A, we may consider the Z- 
module M(A) consisting of all integral linear combinations of its 


rows; that is, 
M(A):={yA:yeZ™}. 


Fix a prime p and for any positive integer 7 define modules 
M; := {x € Z™: p'x € M(A)}, 


N;:= {y € Z™: Ay’ =0 (mod p't?)}. 
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We have Mp = M(A), M; C Mj+; and N; D Nj+1 for all i. Let 
(29.1) C;:= M; (mod p), D;:=N; (mod p). 


That is, read all the integer vectors in M; or N; modulo p to obtain 
C;, or D;. Then each C; and D; is a subspace of the vector space 
Fy”, i.e. a p-ary linear code. Clearly, 


OCC, COC... and Dp DD, DDz.... 
Proposition 29.4. We have 
CSD; 
for all nonnegative integers 1. 


Proor: Let x and y be integral vectors such that x (mod p) € C; 
and y (mod p) € D;. This means 


p'(x+pa)=zA and A(y+pb)'=0 (mod p’*') 
for some integral vectors a, b, and z. Then 


p'(x + pa)-(y + pb)' =zA(y+pb)' =0 (mod p*') 


which implies that x-y' = 0 over F,. 

We complete the proof by showing that C; and D; have dimen- 
sions which add to m. There exist unimodular matrices E’ and 
F (integral matrices with integral inverses) such that S := EAF 
is diagonal with integral diagonal entries d1, d2,...,d, which suc- 
cessively divide one another: dj|d2|...|dm. (S is the Smith normal 
form of A and the d;’s are the invariant factors.) The reader should 
verify that the modules M; and N; are equivalent to 


M! := {x:p'x € M(S)} and 


Nj :={y:Sy' =0 (mod p'*)}, 
respectively, in the sense that either can be obtained from the other 
by application of a unimodular transformation. Hence the dimen- 
sions over F, of M; and Mj are equal, as well as the dimensions of 
N; and Nj. 
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Suppose p’*! does not divide d,,...,d; but that p’t! does divide 
diyi,---,dm. Then y € N} implies that the first t coordinates of y 
are divisible by p; any vector y with 0’s in the first t coordinates 
is in N/. Thus N/ (mod p) is the span of the last m — t standard 
basis vectors and has dimension m — t. Also, x € M} implies that 
the last d — t coordinates of x are divisible by p and a little more 
thought shows that M/ (mod p) is the span of the first t standard 
basis vectors and has dimension ft. 


Proposition 29.5. Let A, B, and U bem x m integral matrices 
with 


(29.2) ABA' =U, 


and where U and B are nonsingular modulo a prime p. Write 
n = p°no where (p,no) = 1. Define the sequence C; of p-ary codes 


from A as in (29.1). Then C. = Fy” and 


CP =C.-;-1 fori=0,1,...,e—-1. 


a 


In particular, if e is odd, then C1 (c-1) is a self-dual p-ary code with 


ym 


respect to the scalar product given by B on 


Proor: Let x and y be integral vectors such that x (mod p) € C; 
and y (mod p) € C,_;_;. This means 


ely + paz) = 2A 


p'(x+pa\)=2,A and p 
for some integral vectors Z1, Z2, a;, and ag. Then 


p’\ (x,y) =p 'xBy' =z,ABA'z.' =0 (mod p*) 


in view of (29.2). Thus (x,y) =0 in F, and we see C._;-1 C CP. 
Now let x € C?. This means xB € C+, which is D; by Propo- 


sition 29.4, and so for some integral vector x’ which reduces to x 
when read modulo p, 


x’/BA'=0 (mod p't'), 
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From (29.2), A: BA'U~! = nl, and since a matrix commutes with 
its inverse, 


(29.3) BAI Ute AS nT. 


Since U is nonsingular modulo p, dU~' is integral for some d prime 
to p, e.g. d:= det(U). We multiply (29.3) on the left by dx’ to get 


x’BA'(dU~')A = p*dnox’, 


and then . 
ZA = p@'!dnox’ 


where z := —4;x’BA'(dU~) is integral. This means p**!dnox’ 
D 
is in M,._;-; and hence x € Cy_;-1. 


The assertion that C. = F)" is left as an easy problem. 


Problem 29B. Prove that C. = ee 


PROOF OF THEOREM 29.3: Let N be the incidence matrix of a 
symmetric (v, k, A)-design and let p be a prime. Assume \ = p”*Xo 
where (Xo, p) = 1 and a > 0; we will explain later what to do when 
X is exactly divisible by an odd power of p. Let 


(29.4) 
p° 1 0 
Ass ry (|, Bes 
o° 1 
p* Xo a p* Xo k 0 — Xo 


The reader should verify, using the properties of N and the relation 
A(v — 1) =k(k-1), that ABA' = nB. 

In the case A is exactly divisible by an even power of p, we apply 
Proposition 29.5 with the matrices A and B as in (29.4), and where 
Soe 

If \ is exactly divisible by an odd power of p, we apply the above 
case to the complement of the given symmetric design, which is a 
symmetric (v, v—k, \’)-design where \’ = v—2k+.. Say X’ = p°Ap 
where (\p,p) = 1. From AX’ = n(n — 1), it follows that c is odd 
and that 

Apr = No(n — 1) =—no (mod p). 
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We have replaced what would be —Xj in the conclusion by Agno, 


which is allowed since they differ by a square factor modulo p, in 
order to express the result in terms of the original parameters. 


The following theorem is a consequence of Problem 19M, but we 
give a proof similar to that of Theorem 29.3. 


Theorem 29.6. If there exists a conference matrix of order n = 2 
(mod 4), then no prime p = 3 (mod 4) can divide the square-free 
part of n — 1. 


PRooF: A conference matrix of order n is, in particular, an integral 
matrix A such that AA’ = (n —1)I,. By Proposition 29.5 with 
A = U = I, there exists a self-dual p-ary code (self-dual with 
respect to the standard inner product) of length n for every prime 
divisor p of the square-free part of n—1. Theorem 29.2 then implies 
that every such prime is = 1 (mod 4). 


OK OK 


When p is a prime not dividing the order v of an abelian group 
G, the group ring F, [G] is a semi-simple algebra over F, = Z,. This 
means that there exist no nonzero nilpotents, i.e. nonzero elements 
a such a™ = 0 for some positive integer m. The nonexistence 
of nonzero nilpotents is proved in Lemma 28.5. By Wedderburn’s 
theorem, every finite-dimensional commutative semi-simple algebra 
with identity A over a field F' is isomorphic to the direct product 
of fields, each an extension of F’. It follows that each ideal Z of A 
is principal and is generated by an idempotent e, i.e. an element e 
with e? = e. See any advanced text on algebra for proofs. 

We do not need all of this information, but here are some facts 
about principal ideals which are generated by idempotents that we 
will need. These make good exercises. 

First, if Z = (e,) and also J = (e), where both e; and e2 are 
idempotent, then e; = e2. Suppose Z; = (e;) and Z2 = (e2), where 
€; and e2 are idempotent. Then 


TNL. = (e1€2) and 7Z,+2.= (e1 + e2 — €1€2). 


Note that e,e2 and e; + e€2 — e,e2 are again idempotents. 
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Theorem 29.7. Let D be an (n*+n+1,n+1, 1)-difference set in 
an abelian group G of order v :=n?+n+1. Ifn =0 (mod 2) but 
n #0 (mod 4), then n = 2. Ifn =0 (mod 3) but n #0 (mod 9), 
then n = 3. 


ProoF: Let D bea (n?+n+1,n+1, 1)-difference set in an abelian 
group G and let p be a prime divisor of n. By Theorem 28.3, p is 
a multiplier of D and we will assume from now on that D is fixed 
by p. We work in the F,-algebra F, |G]. 

Let Z; be the ideal in F,[G] generated by D(x) and TZ, the ideal 
generated by D(x7'). 

We have D(x?) = D(x), but D?(x) = D(a?) by Lemma 28.2, so 
D?(x) = D(x) in F,[G]. Then D?~'(z) is idempotent and will also 
generate Z;. Similarly, D?~!(2~') is an idempotent generator for 
T,. So the idempotent generator of Z, M Z2 is 


D?"(a)D?"(a~") = (n+ G(x)P* = G(a), 
and the idempotent generator of Z; + Z» is 


D?"!(2) + D?“!(a7!) — G(2). 


We now wish to consider the dimensions of Z; and Z» over F,. 
In general, the rank of a principal ideal generated by A(z) is the 
rank of the v x v matrix whose rows are the coefficients of 79 A(z), 
g € G. This matrix is the incidence matrix of a symmetric (n? + 
n+1,n + 1,1)-design when A(x) = D(x) or D(x). If we now 
assume that p” does not divide n, then by Theorem 29.1, Z, and Z2 
have dimension (v + 1)/2. The intersection Z, MZ has dimension 
1, so their sum must have dimension v. The idempotent generator 
of the whole group ring, as an ideal in itself, is 1; so we conclude 
that 


(29.5) D?-l (a) + D(a) —G(a) =1 in F,[G]. 


We are only able to exploit the above equation when p = 2 or 3. 
When p = 2, (29.5) asserts D(x) + D(a!) = 1+ G(x) (mod 2) in 
Z[G]. The number of odd coefficients of 1+ G(x) isv—1=n? +n, 
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but the number of odd coefficients of D(2)+ D(a~') cannot exceed 
2(n + 1); it follows that n < 2. 

When p = 3, (29.5) asserts D?(2) + D?(x~!) = 1+ G(x) (mod 3) 
in Z[G]. We claim that the nonzero coefficients of D?(x) and 
D?(x~') consist of n + 1 coefficients that are 1’s and (05>) coef- 
ficients that are 2’s. If C is any planar difference set, there will 
be a term x79 in C?(x) for each g € C and a term 229*" for each 
unordered pair {g,h} C C; note that A’ = 1 implies, e.g., that 
gi thi € got he unless {g1,h1} = {g2,h2}. The sum of two such 
group ring elements cannot have more than ara + 2(n + 1) coef- 
ficients that are = 1 (mod 3), but 1+ G(x) has n? + n coefficients 


that are = 1; it follows that n < 4. 


Problem 29C. Suppose D is a difference set with n = 2 (mod 4) 
and that 2 is a multiplier of D. What are the parameters of D, as 
functions of n? 


Notes. 

Some of the material preceding the proof of Theorem 29.3 is more 
elegant when p-adic numbers are introduced as in Lander (1983), 
but we have chosen to present the material without them. 
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30 


Association schemes 


Given two k-subsets A,B of an n-set, n > 2k, there are k +1 
possible relations between them: they may be equal, they may 
intersect in k — 1 elements, they may intersect in k — 2 elements, 
..., or they may be disjoint. 

Given two words (k-tuples) a,b € A’, where A is an ‘alphabet’ of 
size at least 2, there are k+1 possible relations between them: they 
may be equal, they may agree in k — 1 coordinates, they may agree 
in k — 2 coordinates, ..., or they may disagree in all coordinates. 

These instances of a set together with a list of mutually exclu- 
sive and exhaustive binary relations are examples of association 
schemes, which we define shortly. Association schemes provide one 
of the foundations of combinatorics and so we include this chapter 
even though it will be difficult reading. They have been implicit in 
many of the previous chapters; we have explicitly discussed 2-class 
association schemes, as they are equivalent to the strongly regular 
graphs discussed in Chapter 21. This chapter elaborates on some 
of the material of Chapter 21 but has different goals. 

Association schemes arose first in the statistical theory of design 
of experiments, but the work of Ph. Delsarte (1973) has shown how 
they serve to unify many aspects of our subject. In particular, cer- 
tain results of coding theory and the theory of t-designs—which 
were originally discovered independently—are now seen to be ‘for- 
mally dual’ aspects of the same ideas in association schemes. For 
example, Fisher’s inequality and its generalization, Theorem 19.8, 
is formally dual to the sphere packing bound, Theorem 21.1. We 
use the machinery of association schemes in this chapter to give 
proofs of Lloyd’s theorem on perfect codes and its formal dual theo- 
rem for tight designs and orthogonal arrays. Delsarte’s inequalities, 
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Theorem 30.3, on the distribution vector of a subset of an associ- 
ation scheme provide a ‘linear programming bound’ on the size of 
codes and are also of interest in extremal set theory. 

By a binary relation on a set X, we mean a subset of X x X. A 
k-class association scheme, sometimes we say just scheme, on a set 
X of points consists of k + 1 nonempty symmetric binary relations 
Ro, Ri,..., Ry on X which partition X x X, where Ro = {(x, 2) : 
x € X} is the identity relation, and such that for some nonnegative 
integers pi, 0 < £,1,7 < k, the following system of axioms holds: 
given any (x,y) € Rp, there are exactly Pi; elements z € X such 
that (x, z) € R; and (z,y) € R;. We say x,y € & are i-th associates 
when (z,y) € R;. 

The numbers Di, 0 < £,i1,7 < k, are the parameters of the 
scheme. That p?, exists means that there is a constant number 
of i-th associates of any element of X, which is usually denoted by 
n;. We have 


(30.1) py, =n; and Py, =) tors, 4 


and 
no =1, Nnotnyt--- tn, = N. 


where N := |X|. The numbers no,71,...,% are called the degrees 
of the scheme. 


Example 30.1. The Johnson schemes J(v,k). The points of the 
scheme J(v,k) are the (,) k-subsets of a v-set S. Two k-subsets 
A, B are declared to be i-th associates when |AN B| = k —7. Thus 
0-th associates are equal. The parameters pi, exist ‘by symmetry’ 
and may be expressed as sums of products of binomial coefficients, 
but we will not bother to write these out in general. 

The scheme J(6,3), for example, is a 3-class scheme with 20 
points. The reader should check that ny = ng = 9, n3 = 1. A few 
of the other parameters are pj, = 4, p}, = 4, p;, = 0. 


Example 30.2. The Hamming schemes H(n,q). The points of 
H(n,q) are the q” words of length n over an alphabet of size q. 
Two n-tuples x,y are declared to be 7-th associates when they dis- 
agree in exactly 2 coordinates. Thus 0-th associates are equal. The 
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parameters Pi; exist ‘by symmetry’ and may be expressed as sums 
of products of binomial coefficients and powers of qg—1, but we will 
not bother to write these out in general. 

The scheme (5,3), for example, is a 5-class scheme with 125 
points. The reader should check that nj = 5-2, ng = 10-4, etc. 


Each of the relations R; may be thought of as the adjacency rela- 
tion of a graph G; on the vertex set X. (A scheme is a special kind 
of partition of the edges—or coloring of the edges—of a complete 
graph.) It should be clear that if we start with a 2-class scheme, 
then G is a strongly regular graph with degree ni, \ = pj), and 
u = pi;. In fact, any strongly regular graph gives rise to a 2- 
class association scheme when we declare that two distinct vertices 
are Ist associates when they are adjacent in G and 2nd associates 
if not adjacent; the other parameters exist, i.e. are constant, and 
can be computed from the parameters of the graph. For example, 
Pig = k-A-1, 

A distance regular graph is a graph G such that the number of 
vertices at distance 7 to x and distance j to y depends only on the 
distance @ between the vertices x and y, not on the particular ver- 
tices. That is, defining two vertices to be 7-th associates if and only 
if their distance in G is 7 produces an association scheme (where the 
number of classes will be the diameter of the graph). See Brouwer, 
Cohen, and Neumaier (1989). Schemes which arise in this way are 
called metric. The schemes mentioned in Examples 30.1 and 30.2 
above are metric, as are those in Examples 30.3 and 30.4 below. 

It is important to know that the parameters pi, exist; it is not so 
important to know their exact values. This is lucky because there is 
often no convenient expression for these parameters. In the scheme 
J(v,k), for example, a triple sum of binomial coefficients seems to 
be required for the general pi. In the examples we have just given, 
we know that the pi,’s exist because of ‘symmetry’. More precisely, 
in each case there exists a group G of permutations of X so that two 
ordered pairs of points (x1, y1) and (x2, y2) are in the same relation 
R; if and only if there exists 0 € G such that o(a71) = x2 and 
o(y1) = yo. That is, the relations Ro, R,,..., Ry are the orbits of G 
on X x X (with Ro the trivial orbit of all (z,x),2 € X). In Example 
30.1, G is the symmetric group S, acting on all k-subsets of a v-set; 
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we can find a permutation which takes an ordered pair of k-subsets 
to another if and only if the size of the intersection of each pair is 
the same. In Example 30.2, G is the wreath product of S, with S;, 
(that is, we allow any permutation of the n coordinates followed by 
independent permutations of the g symbols in each coordinate); we 
can find such a transformation which takes an ordered pair of n- 
tuples to another if and only if the pairs agree in the same number 
of coordinates. 

In general, if G is a transitive group of permutations on a set X 
such that the orbits of G on X x X are symmetric, they may be 
taken as the relations of an association scheme on X. Our next 
three examples also arise in this way. 


Example 30.3. This is the g-analogue of the Johnson scheme: 
Take the k-subspaces of a v-space V over F, as points. Two k- 
subspaces A,B are declared to be i-th associates when dim(A NM 
B)=k-i. 


ni 


Example 30.4. Take the k x m matrices over F, as the points, 
where k < m, say. Two matrices A,B are declared to be i-th 
associates when the rank of A — B is k — i. 

To put this example into the framework above, we let X be the 
set of all k x m matrices and we may take G to be the set of all 
permutations X ++ UXW +C as U ranges over the nonsingular 
k x k natrices, W over the nonsingular m x m matrices, and C’ 
over the k x m matrices. If (A,B) and (A’, B’) are pairs of k x 
m matrices such that the ranks of A — B and A’ — B’ are equal, 
then U(A — B)W = A’ — B’ for some nonsingular U and W, and 
then X + UXW + (B’ —UBW) will map the first pair onto the 
second. 


Example 30.5. The cyclotomic schemes are obtained as follows. 
Let q be a prime power and k& a divisor of gq—1. Let C, be 
the subgroup of the multiplicative group of F, of index k, and let 
C1, Co,...,C, be the cosets of C;. The points of the scheme are to 
be the elements of F,, and two points x,y are declared to be i-th 
associates when x — y € C; (and 0-th associates when x — y = 0). 
In order to have a scheme with the above definition, we require 


—1 €C; so that the relations will be symmetric, i.e. 2k must di- 
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vide q — 1 if g is odd. Cf. Example 21.3 which is the case k = 2. 


We introduce the association matrices Ag, Ai,..., Ax (also called 
the adjacency matrices) of an association scheme. These matrices 
are square with both rows and columns indexed by the elements of 
the point set X of a scheme. For 7 = 0,1,...,k, we define 


1 if (x,y) € R,, 
0 otherwise. 


Ai(a,y) = { 


The matrices A; are symmetric (0, 1)-matrices and 
Apg=I, AptAit---+Ap=J 


where J is the all-one matrix of size N x N. We denote by 2 
the linear span over the reals of Ap, Aj,...,A,;. These matrices 
are linearly independent since each contains at least one 1; and a 
position in which A; has a 1 contains a 0 in every other association 
matrix. The axioms of an association scheme are exactly what is 
required to ensure that 2 is closed under matrix multiplication. To 
see this, it suffices to show that the product of any two of the basis 
matrices is in 2( and, in fact, we have 


k 
(30.2) A;A; = Spi; Ac 
£=0 


because A;A;(x,y) is the number of z such that A;(x,z) = 1 and 
A,(z,y) = 1, and this number is pi, where ¢ is such that A;(x, y) = 
1. 

The algebra 2 is called the Bose—Mesner algebra of the scheme; 
this algebra was introduced for strongly regular graphs in Chapter 
21. We note at this point that not only is 21 closed under normal 
matrix multiplication, but it is also closed under Hadamard mul- 
tiplication as introduced in Problem 21E. The Hadamard product 
Ao B of two matrices is the matrix obtained by coordinate-wise 
multiplication: 


(Ao B)(x,y) = A(z, y) B(x, y). 
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As an algebra with respect to Hadamard multiplication, 2 is 
almost trivial. We have 


Hea A; ifi=j, 
. (e) = — 
a O ifiAgj 


(that is, Ag, Ai,..., Ax are orthogonal idempotents), and the sum of 
the A;’s is J, the identity with respect to Hadamard multiplication. 
So Hadamard multiplication is extremely simple when matrices in 
21 are expressed with respect to the basis Ap, Ai,..., Ax of 2. 

However, a well known result of matrix theory (an extension 
of the spectral theorem which says that a symmetric real matrix 
has an orthogonal basis of eigenvectors) asserts that a commuta- 
tive algebra of real symmetric matrices has a basis of orthogonal 
idempotents with respect to ordinary matrix multiplication which 
sum to the identity. More geometrically, there exists an orthogonal 
decomposition 


RF ~YHOWUO-:- OV, 


of the Euclidean space R*, the space of all vectors whose coordi- 
nates are indexed by the elements of X with the standard inner 
product, such that the orthogonal projections Eo, F),...,£, from 
R* onto the subspaces Vo, Vi, ..., Vs, respectively, are a basis for 
21. We have 


ng, fm fii, 
“FLO. ift Fs, 


By dbp hee Bee, 


and 


Of course, when matrices are expressed with respect to the basis 
Eo, F,,..., Ex, ordinary multiplication is also extremely simple. 
The subspaces Vo, Vi,...,Vz% are called the eigenspaces of the 
scheme: in a linear combination M = eet A; E;, each vector in 
V; is an eigenvector of value A; for M. There is no natural number- 
ing of the eigenspaces in general with one exception: since J € Al, 
and J has the vector j of all 1’s as an eigenvector of value N and all 
vectors orthogonal to j as eigenvectors of value 0, it must be that 
one of the eigenspaces consists of scalar multiples of j alone—we 
shall always assume that this is Vo. Then the orthogonal projection 
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onto Vo (which has j as an eigenvector of value 1 and all vectors 
orthogonal to j as eigenvectors of value 0) is 


1 
[en p 
ae 


We let m; denote the dimension of V;. Then 
mo = 1, Mo tm, +--- +m, = N. 


Note that m; is the trace of E; since the eigenvalues of E; are 1 
(with multiplicity equal to the dimension of V;) and 0. The numbers 
mo,™M1,..., Mrz are called the multiplicities of the scheme. 


Example 30.6. We can explicitly describe the eigenspaces of the 
Hamming scheme H(n, 2). The points of this scheme are the binary 
n-tuples (or words) a of Fi. 

For each a € Bj’, define a vector vz with coordinates indexed by 


the point set by 


v,(b) = (—1)), 


These vectors are orthogonal and are the rows of a Hadamard ma- 
trix; see Fig. 30.1 and Chapter 18. We claim that V; may be taken 
to be the span of all vectors v, as a ranges over the words of weight 
bt = Odes 

We check that each v, is an eigenvector of all association matrices 
A;. Let a have weight ¢. Then 


(v.A;)(b) = S_ va(e)Aj(c, 8) 
pas x (—1)' = (1) SS (—1)(ab+e) 


c:d(b,c)=j e:d(b,c)=j 
(a u) : 4 £L n— 4 
=va(b) >> (-1)%" =va(b) $_(-1) ei ece 
u:wt(u)=7 1=0 


This calculation shows, with FE; the matrix of the orthogonal 
projection onto V;, that 


EEG) 
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since the two sides give the same result when premultiplied by any 
v,. So UC span{F#o,...,£,}. Equality holds because both spaces 
have dimension n + 1. 

The eigenvalues of the schemes H(n, q) for general q are given in 
Theorem 30.1 below. 


ie aes i a es 011 111 
(2 ae Ae Sasi 010. [110 
hab i ay = ok 

ie ie th 2 el, 

Cort) Ss eh! a 

pee ah hel he ea 

Sot SH Sis i 4 —.001_ | 101 
Pata to at Mie et 000 100 

Figure 30.1 


For example, if we order the words of length n = 3 as 
000, 100, 010, 001, 011, 101, 110, 111, 


then the eigenspaces Vo, Vi, V2, V3 of the cube (= A, of H(3,2)) 
shown at the right of Fig. 30.1 are spanned respectively by the first 
row, the next three rows, the next three rows, and the last row of 
the Hadamard matrix at the left of Fig. 30.1. 


Example 30.7. We now describe the eigenspaces of the Johnson 
scheme J(v,k). We omit the proofs; see Delsarte (1973) or Wilson 
(1984b). 

The point set of this scheme is the set X of k-subsets S of a v-set 
X. For each subset T C X of size < k, let er be the vector of 
length (7) in R* where 


Cae bed ee are 


0 otherwise. 


e7(S) := { 


Let U; be the span of {er : T C X,|T| = 7} fori =0,1,...,k. We 
claim that Up CU; C --- CU, = R* and that U; has dimension 
Cle Let Vo := Up (the constant vectors) and for i > 0, let V; be the 
orthogonal complement of U;_, in Uj, i.e. 


VU) Us: 
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It is clear that Vo,Vi,..., Vj; are orthogonal and sum to R*. It can 
be shown that each vector in V; is an eigenvector of A; of value 
P;(i) as displayed in Theorem 30.1(i) below. 


Since we have two bases of the vector space 21, we may consider 
the transformation matrices between them, which are called the 
eigenmatrices of the scheme. Define P (the first eigenmatrix) and 
Q (the second eigenmatrix) as the k+1 x k+1 matrices with rows 
and columns indexed by 0,1,..., such that 


(A A ic CA BB ace 


and 
N (Eo, F,..-, Fx) = (Ap, Al,---, Ag) Q- 


We write P)(z) for the (7,2) entry of P, and similarly, we let Q)(2) 
be the (i, 2) entry of Q, so that 


(30.3) Ac = P,(0) Ey + Pp(1)\Fy +--+: + Prk) Ex, 
and 
(30.4) N E¢ = Qe(0)Ao + Qe(1) Ar + +++ + Qe(k) Ar. 


Of course, we have 
Q=NP", P=NQ. 


The ¢-th column of P consists of the eigenvalues of Ay. 

While it is not so important to know the parameters Pi; of a 
scheme, it is important for applications to know the eigenmatrices 
P and Q. See Bannai and Ito (1984) or Delsarte (1973) for proofs of 
the following theorem and the determination of the eigenmatrices 
of other schemes. 


Theorem 30.1. (i) For the Johnson scheme J(v,k), the degrees 
are ny = () ro) and the multiplicities are mp = e - (es C= 
0,1,...,k. The entries of the first eigenmatrix are P;(i) where 


ae Ene ¢ a G - ") (' = : ie *) 


a=0 
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(ii) For the Hamming scheme H(n, q), the degrees and multiplic- 
ities are ng = mp = (i) (q —1)', £=0,1,...,n. The entries of the 
first eigenmatrix are P;(i) where 


ren Beane o(-2)(0) 


a=0 


Problem 30A. Calculate the eigenmatrix P of J(8,3) in the fol- 
lowing way. First calculate A; A» as a linear combination of Ag, Aj, 
A», and A3, and then fill in the missing line in the following table: 


Arp= Abo, 

Ay Ai, 

A? = 15Ag + 6A; + 4Ap, 

AS = ; 


A} = 1245.Ay + 1036.4; + 888.42 + 720A3. 


From this table, it is only mildly tedious to derive the minimal 
polynomial of A,. Find the eigenvalues of A,;. Now express Ag and 
A3 as polynomials in Aj, and so find their eigenvalues. Find the 
multiplicities. Check your work by verifying the orthogonality rela- 
tions of Theorem 30.2 or just calculating the values from Theorem 
30.1(i). 


Problem 30B. Show how to calculate all the parameters pi, of 
a scheme given the eigenmatrix P. That is, prove that they are 
uniquely determined by P. 


Problem 30C. The Latin square graphs are srg(v, k, X, w)’s where 
for some integers n and r, 


v=n’, k=r(n-1), A=(n—-2)4+(r—-1)(r—-2), w=r(r—-1). 


These were introduced in Example 21.7 for r = 3. Find the eigen- 
matrices P and @ for the 2-class schemes corresponding to these 
graphs. 

The so-called negative Latin square graphs are strongly regular 
graphs srg(v, k, A, 4) whose parameters are obtained from the above 
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by replacing n by —n andr by —r. Sou = (—n)’, k = (—r)(—n-1), 
etc. (It is strange that this should yield parameters satisfying 
(21.4), but it does.) Find the eigenmatrices P and Q for the corre- 
sponding 2-class schemes. 


Theorem 30.2. The eigenmatrices of a scheme satisfy the orthog- 
onality relations 


il). *) 0 1 0 0 
+ 0 my 0 0 ny 0 
P ‘ Aes P=N 
0 0 ++ mp 00 + ne 
and 
10 0 1 0 0 
T 0 ny 0 0 my 0 
Q - 4 2 | Q=N]. 
00 + np 0 0 ++ mp 


PROOF: The vector space 2 can be equipped with an inner product 
in a more-or-less natural way: We define (A, B) to be the sum of 
the entries of the Hadamard product Ao B. Check that this is the 
same as the trace of the matrix product AB' (which is the same 
as tr(AB) when B is symmetric, of course). 

With respect to this inner product, the basis Ap, A1,..., Ay is an 
orthogonal basis (but not orthonormal); we have (A;, A;) = N nj. 
But Eo, £i,...,E, is also an orthogonal basis since E;E; = 0 for 
i # j; we have (Fj, E;) = tr(E;) = m;. The derivation of the 
theorem from this is now elementary linear algebra. 

Consider the first relation. The entry in the (a, () position on 
the right is 


(Aq, Ag) = (2, Palt) BD Pal) (j)E; 
-EA0 (i) P3(J) (Ei, E;) a al 


and this last expression is the entry in the (a,(@) position on the 
left. The second relation is derived similarly. 
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Another way to express the content of the above theorem is 


10 - O cl 1: Oi Lvs 46) 
Q = st : : pt ake : 
00 + ne 0 0 ++ mp 
Equivalently, 
(30.5) mi P,(j) = niQ;(@) 


forall 429 = Uy lara 

With Theorems 30.1 and 30.2, we can find the second eigenmatrix 
Q for the schemes J(v,k) and H(n,q). It is somewhat surprising 
that P = Q for H(n,q); the reader should check this. 


Problem 30D. Explain why the zeroth row and column of P and 
Q are as indicated below. 


Ly ee lm... Mrz 
1 1 

(30.6) P=|. , Q=|. 
a 1 


Delsarte (1973) observed that the columns of the second eigenma- 
trix Q provide a system of linear constraints, which we will call Del- 
sarte’s inequalities, on, what he calls, the ‘inner distribution vector’ 
of a nonempty subset Y of the point set X of an association scheme. 


We define the distribution vector of Y to be a = (ao, a@1,..., Gx) 
where l 
a; = —|(Y x Y)N Ril; 
[Y| 


that is, a; is the average number of 7-th associates y € Y of an 
element x € Y. We have 


ao = 1, ag +a, +---+ap=|Y]. 
For many interesting subsets Y, the number of i-th associates 


y € Y of an element x € Y is constant, i.e. it does not depend on 
which x in Y is chosen. For example, this is true if C is a linear 
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code in H(n, q) in which case the distribution vector coincides with 
what we called the weight enumerator of C’ in Chapter 21, and 
also for other beautiful configurations which have been discussed 
in previous chapters and whose distribution vectors we list below. 
Here Hamming(7) is the Hamming code of length 7, ‘Golay’ is short 
for ‘Golay code’, ‘X’ means ‘extended’, and a ‘*’ denotes ‘dual code’. 


Object Scheme Distribution vector 
S(2y3, 0) J(7,3) — (1,0,6,0) 
S'(2, 5, 11) J(11,5) (1,0,0,10,0,0) 
So(3, 6, 12) J(12,6) (1,0,0,20,0,0,1) 
5 (5, 6;.12) J(12,6) (1,0,45,40,45,0,1) 
S(A, 7, 23) J(23,7) (1,0,0,0,140,0,112,0) 
(5, 8, 24) J(24,8) (1,0,0,0,280,0,448,0,30) 
Hamming(7) H(7,2)  (1,0,0,7,7,0,0,1) 
X-Hamming(7) H(8,2) (1,0,0,0,14,0,0,0,1) 
Binary Golay  H(23,2) (1,0,0,0,0,0,0,253,506,0,0,1288, ...) 
X-Binary Golay H(24,2) (1,0,0,0,0,0,0,0,759,0,0,0,2576,0, ...) 
Ternary Golay #H(11,3) (1,0,0,0,0,132,132,0,330,110,0,24) 


*-Ternary Golay H(11,3) (1,0,0,0,0,0,132,0,0,110,0,0) 


Theorem 30.3. The distribution vector a of a nonempty subset 
of an association scheme satisfies 


aQ) > 0 
where 0 is the row vector of k + 1 zeros. 


PROOF: Let ¢ € R* be the characteristic vector of Y. That is, 


1 ifvey, 
oe) = { 0 ifady. 
Then 
Q= peal 
[Y| 


Since E, is idempotent and symmetric, 


0 < ||\bE¢|? = (¢Ex)(¢Ex)' = End! 
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We note for further reference that the ¢-th inequality holds with 
equality if and only if the projection @Ey is the zero vector. The 
zeroth inequality is trivial since the zeroth coordinate of a@ is ag + 
a, +---+ ax, which of course is nonnegative. 


Example 30.8. For J(8,3), Delsarte’s inequalities when fractions 
are cleared are 


15 + 7a, — a2 — Yaz > 0, 
30 + 2a; — 5a2 + 9a3 > 0, 
10 — 2a; + ag — az > 0. 


These can be viewed as necessary conditions for the existence of a 
family F of 3-subsets of an 8-set with a given distribution vector 
tl, a1, 42, az). 

We can ask what they imply about |F| when we assume some of 
the a;’s are zero. For example, assume a3 = 0 (no two members of 
F are disjoint). The maximum value of 1 + a; + a2 (= |F]|) subject 
to these inequalities is 21; this is a linear programming problem. 
We already knew |F| < 21 because of the Erdés-Ko-Rado theorem, 
Theorem 6.4. Theorem 30.3 implies Theorem 6.4; see the notes. 


Theorem 30.3 leads to a linear programming bound on the car- 
dinalities of codes. For the Hamming scheme H(n,q), given an 
integer d, we may consider the linear programming problem 


“maximize 1+ ag+dq41 +--+: +a, subject to a; > 0 and 
GI 6 repre | Pare ers Se emer mm aed Be 


where Q is the n+ 1 by n+ 1 second eigenmatrix of the scheme. If 
LPB denotes the maximum value of 1 + ag + aqg41 +--+: +a, under 
these conditions and C’ is any q-ary code of length n with minimum 
distance at least d, then |C| < LPB since the distribution vector a 
of C' satisfies the conditions and the coordinates sum to |C|. 

We mention that for the Hamming schemes, if a is the distri- 
bution vector of a linear code C, then qaQ is the distribution 
vector of the dual code C+. To see this, just compare the formulae 
in MacWilliams’ theorem, Theorem 20.3, with those in Theorem 
30.1(ii). This certainly explains why aQ > 0 for the distribution 
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vector a of a linear code, but we have proved above that it holds 
even if C' is not linear. 


n d SPB LPBiapajaz ag a4 a5 a6 7 dg a9 G40 Gn 42413014015 
11 3 170.7 170.71 0 0 18.3 36.7 29.3 29.3 36.7183 0 O 1 
115 306 24/100 0 OO 11 11 0 0 0 0 1 
117 88 4/100 0 0 0 0 2 1 0 0 0 
12 3 315.1 292.6100 20 45 48 56 65.1 40.711.4 3.4 1.70.1 
125 519 40/100 0 O 15 175 0 0 5 15 0 0 
127 13.7 53/100 0 0 O O 2.7 1.7 0 0 0 0 
13 3 585.1 512)/100 22 55 72 96 116 87 40 16 6 1 
135 89.0 64]100 0 O 18 24 4 3 10 4 0 O 
0 
7 
0 


137 21.7 8 100 0 0 0 0 4 8 0 0 0 
14 31092.31024)100 28 77 112 168 232 203 112 56 28 
145 1546 128)100 0 O 28 42 8 7 28 14 O 
147 349 16/100 0 0 0 0 8 7 0 0 0 O 
153 2048 2048)100 35 105 168 280 435 435 280 168 105 35 
155 270.8 256/100 0 O 42 70 15 15 70 42 0 O 
157 569 382/100 0 0O O O 15 15 0 0 0 0 


oo o;}o o0o;}o Co O&O 


oOo OoO;Co CO oO 


Figure 30.2 


It is difficult to extract useful information in general from the 
linear programming bound (LPB), but it can be shown that it is al- 
ways at least as good as the sphere packing bound (SPB), Theorem 
21.1; see Delsarte (1973). For small values of the parameters, one 
can use the simplex algorithm and expicitly evaluate this bound. 
We have done so for some values of n and d in the case of binary 
codes and display the results in the table of Fig. 30.2. The table 
gives a nonnegative vector (1,0,...,0,@a, @a+1,---,;@n) whose sum 
is maximum subject to aQ > 0. It is a good exercise for the reader 
to try to decide whether codes meeting the LPB as in the table 
actually exist. We mention that several can be obtained from an 
interesting binary code of length 16 called the Nordstrom-Robinson 
code, which has 256 codewords and minimum distance 6. It is not 
linear. See problems 30I and 30J. 

There appears to be a kind of ‘duality’ between the two bases of, 
and between ordinary and Hadamard multiplication on, the Bose— 
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Mesner algebra. It sometimes happens that there exists a second 
association scheme so that the £;’s of the latter under Hadamard 
multiplication act like the A;’s of the former under ordinary multi- 
plication (apart from a scalar factor), and vice versa. Two schemes 
are said to be formally dual when the first eigenmatrix P of one is 
equal to the second eigenmatrix Q of the other (cf. Problem 30E 
below). Many examples of pairs of formally dual strongly regular 
graphs are known. The Hamming scheme, as mentioned above, is 
formally dual to itself. The Johnson scheme usually has no formal 
dual, because in general the Krein parameters defined below are 
not integers. 

The Krein parameters of an association scheme are the (k + 1)? 
numbers qi; defined by 


k 
(30.7) NE,oE;=)_ q,Et. 
=0 


If the scheme has a formal dual, these are the parameters pi, for 
that formally dual scheme and hence are nonnegative integers. We 
always have 

@ 20. for all 0 Sag 2 Sk 


because the qi,’ are eigenvalues of the Hadamard product of two 
positive semidefinite matrices and are hence nonnegative; cf. Prob- 
lem 21E. In principle, the Krein parameters are functions of the 
original parameters pi, (see Problem 30E) and their nonnegativity 
can be viewed as a necessary condition for the existence of a scheme 
with given parameters; we did this for strongly regular graphs in 
Theorem 21.3. 


Problem 30E. Show how to calculate all the parameters qi; of 
a scheme given the eigenmatrix @. That is, prove that they are 
uniquely determined by Q. 


We need to know later that 
(30.8) di=m and qj, =0 fori fj. 


This follows from (30.7) when we consider the sum of all entries of 
the matrices on both sides; the sum of the entries of the left-hand 
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side is N times 


m, ifi=j, 
ely { O otherwise. 

For some applications to codes and designs, we need to know 
more about a scheme. However, here is one interesting result we 
can prove for general schemes before we specialize. 

Given a k-class scheme and a subset K of {1,2,...,k}, a subset 
Y of the point set is called a K -clique when for every pair of distinct 
elements x,y € Y, x and y are j-th associates for some 7 € K. The 
subset Y is a K-coclique when no pair of elements z7,y € Y are 
j-th associates for any 7 € K. If G is the graph whose adjacency 
matrix is )> ick Aj, then cliques in G are the same as /-cliques 
in the scheme, and cocliques (independent sets of vertices) are the 
same as K-cocliques in the scheme. 


Theorem 30.4. Let A C X bea K-coclique and B C X a K-clique 


in a k-class association scheme on a set X where K C {1,2,...,k}. 
Then 

|A||B] < N. 
Proor: Let a = (ao,a@1,...,a,) be the distribution vector of A 


and let b = (bo, b1,...,b%) be the distribution vector of B. Take 
the inverse of the first equation of Theorem 30.2, premultiply by a 
and postmultiply by b!' to get 


10 - 0 —l 10 - 0 -1 
0m 0 0 ny 0 

(30.9) aQ]. . . | (bQ)'=Na lf, Be 
0 0 -+ mr 00 - nr 


By Theorem 30.3, both aQ and bQ are nonnegative vectors. Their 
zeroth coordinates are |A| and |B], respectively, so the scalar on 
the left of (30.9) is at least |A||B|. Our hypothesis implies that 
a,b; = 0 for 7 > 0, so the scalar on the right of (30.9) is N. 

An example of equality in Theorem 30.4 in J(v,k) occurs when 
there exists an S(t,k,v). Take K = {1,2,...,k —t}. Then the 
block set of the S(t,k,v) is a K-coclique. The set of all k-subsets 
containing a fixed t-subset is a K-clique. An example of equality 
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in Theorem 30.4 in H(n,q) occurs when there exists a perfect e- 
error-correcting code C. Take K = {1,2,...,2e}. Then C isa 
K-coclique. A sphere of radius e about a fixed word is a K-clique. 


Problem 30F. Find an example of a regular graph G, a clique A, 
and a coclique B such that |A||B| > |G]. 


Problem 30G. Prove that if A is a clique and B a coclique in a 
graph G which admits a transitive group of automorphisms, then 
|A] |B] < |G. 


We now define polynomial schemes, of which there are two types. 
An association scheme (with a particular numbering A), Ao,..., Ax 
of its association matrices) is said to be P-polynomial when A; 
is a polynomial of degree 7 in A; for i = 0,1,2,...,k4. An asso- 
ciation scheme (with a particular numbering F}, F2,..., E, of its 
idempotents) is said to be Q-polynomial when E; is a Hadamard 
polynomial of degree i in FE, for i = 0,1,2,...,&4 (we mean that 
there is a polynomial of degree 7 so that EF; results by applying the 
polynomial element-wise to, i.e. to each entry of, FE). 


Problem 30H. Prove that an association scheme is P-polynomial 
if and only if it is a metric scheme. 


The @-polynomial schemes are also called cometric, but there 
seems to no simple geometric interpretation of cometric. We use 
the terms metric and cometric rather than P- and Q-polynomial. 

It can be seen that the Hamming and Johnson schemes are co- 
metric; see Delsarte (1973). 

We define a d-code in a metric scheme to be a subset S C X 
whose characteristic vector @ satisfies 


¢A@' =0 fori=1,2,...,d—1. 


We define a t-design in a cometric scheme to be a subset S C X 
whose characteristic vector @ satisfies 


¢E:o' =0 fori=1,2,...,t. 


The combinatorial significance of d-codes is straightforward: S 
is a d-code if and only if no two distinct elements of S are i-th 
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associates for i < d. So in the Hamming scheme, S is a (2e + 1)- 
code if and only if it is e-error correcting, i.e. has minimum distance 
at least 2e + 1. The combinatorial significance of t-designs is not 
so clear and can only be understood with more knowledge of the 
particular scheme. Note that S is a t-design if and only if the first 
t nontrivial inequalities on the distribution vector of S in Theorem 
30.3 hold with equality, i.e. that ¢E; = 0 for 7 =1,...,#. 

The following theorem explains that designs in the Johnson and 
Hamming schemes correspond to the classical concepts of t-designs 
and orthogonal arrays, respectively. An orthogonal array of index 
A and strength t on a set A of gq symbols is a subset C C A” of 
Aq words so that for every choice of t of the n coordinates, each 
possible t-tuple of elements of A occurs exactly times in those t 
coordinates among the members of C. 


Theorem 30.5. 

(i) A family S' of k-subsets of a v-set is a t-design when considered 
as a subset of J(v, k) if and only if it is the set of blocks of a t-design 
in the classical sense. 

(ii) A family S of n-tuples from an alphabet A of q elements is a 
t-design when considered as a subset of H(n,q) if and only if it is 
the set of columns of an orthogonal array of strength t. 


PARTIAL PROOF: We will show that the definition of t-design in 
the schemes implies that the family S is a classical t-design or 
orthogonal array and leave the converses of both parts (i) and (ii) 
for the reader. 

Let S' be a t-design in J(v,k) in the sense of association schemes 
and ¢ its characteristic vector. In Example 30.7, we saw that Ey + 
FE, +---+ E;, is the orthogonal projection onto the space spanned 
by er as T ranges over the t-subsets of the v-set. The number of 
members of S containing a t-subset T' is 


der = ¢(Eg +E, +---+ Ever = oper. 


Since Eo is a scalar multiple of the all-one matrix J, this number 
is independent of the particular t-subset T’. 

Let S be a t-design in H(n,q) in the sense of association schemes 
and ¢ its characteristic vector. In Example 30.6, we described the 
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eigenspaces for gq = 2, and we prove the theorem only in this case. 
A complete proof may be found in Delsarte (1973). 
With the notation of Example 30.6, 


(~, va) = D(-1)@", 


beS' 


so S is a t-design if and only if 


So(-pia” =0 for all nonzero a of weight < t. 
bes 


For example, when a is taken of weight 1, the above equation 
implies that in any given coordinate position, half the members of 
S have entry 0 and half have entry 1. When a is taken of weight 
2, e.g. as (1,1,0,0,...,0), the equation implies that the number of 
members of S that begin with 00 or 11 is equal to the number that 
begin with 10 or 01; it follows from this and the previous sentence 
that exactly 1/4 of the n-tuples in S begin with each of 00, 10, 01, 
11. 

In general, consider a t-subset T C {1,2,...,} of coordinate 
positions. For each subset I of T, let A; denote the number of 
a € § with entry 1 in coordinates of J and entry 0 in coordinates of 
T \I. For each subset I of T, let ay denote the binary n-tuple with 
1’s in coordinates of J and 0’s in the remaining n— |.J| coordinates. 
We have 2° linear equations, one for each J C T: 


|S| if J=0 
AI — Ag: = 
py : py ; { O otherwise. 


|INJ|=0 (mod 2) |INJ|=1 (mod 2) 


Clearly A; = |S|/2' for all J C T is a solution of this system. But 
the coefficient matrix of this system is a Hadamard matrix of order 
2' (see Chapter 18) and in particular is nonsingular, so the solution 
is unique. Thus S is an orthogonal array of strength t. 


The next two theorems are ‘formal duals’. That is, their proofs 
are similar but the roles of the bases of A;’s and E;’s are inter- 
changed, as is the role of ordinary and Hadamard multiplication 
of matrices. For the Hamming schemes, Theorem 30.6(i) reduces 
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to the sphere packing bound, Theorem 21.1, and part (iii) gives a 
very strong condition, due to S. P. Lloyd (1957), for equality to hold 
(perfect codes—see Chapter 20). For the Johnson schemes, Theorem 
30.7(i) reduces to Theorem 19.8, and part (iii) gives a very strong 
condition, due to Ray-Chaudhuri and Wilson (1975), for equality 
to hold (tight designs-see Chapter 19). 


Theorem 30.6. Let C be a (2e+1)-code in a k-class metric asso- 


ciation scheme on a set X. Let ¢ € R* be the characteristic vector 
of C. 
(i) We have 


IC| < N/U+n1+ ne+-+++ Ne). 
(ii) There are at least e indices i € {1,2,...,k} such that 
bE! £0. 


(iii) Equality holds in (i) if and only if equality holds in (ii), in 
which case the e indices i such that ¢E;¢' 4 0 are exactly those 
integers 1 for which 

d_ Pili) = 
=0 


ProoF: Let ¢ be the characteristic vector of a (2e+1)-code C and 
consider the expression 


a := b(cpAg + 1A +--+ + Ae)*o! 


where Co, C1,.--,Ce are scalars. We shall evaluate a in two ways. 
Introduce 


f (4) := ep Po(i) + cy P(t) + - +> + oP. (2) 


as a function of the c;’s. By (30.3), coA9 + crA1 +--- + ¢-Ae = 
ee ofl Ei SO 


o= (Sv wes) (1109 on) = 50) °OEie". 
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On the other hand, A; is a polynomial of degree 7 in A;, so (Ap + 
A, +-+++A,)? is a polynomial of degree 2e in A, and hence a 
linear combination of Ag, A;,...,A2-. Our hypothesis implies that 
~A;b' = 0 for i = 1,2,...,2e. Thus to evaluate a, we need only 
the coefficient of Ag when (Ag +Ai+--:+ A)? is written as a linear 
combination of Ap, Ay,..., Aa-; by (30.2) and (30.1), 


a= (> ve bAod' = bs en) |C|. 
i=0 


i,j=0 


Note that by (30.6), f(0) = congo + any +--+ + cere. We now 
combine the two values for a, remembering that Ey = x so that 
ood! = %\C|’, to obtain 


(30.10) (ce+cny+---+en,)|C| = y4) )>oE.d" 


= iy, + en, +++ + cen)? |CP?. 
N 

Everything will follow from (30.10). Part (i) follows when we 
take all c; := 1. 

To prove part (ii), suppose for contradiction that there are fewer 
than e indices i > 1 such that ¢E;¢' 4 0. By elementary linear 
algebra, there exist scalars co,..., Ce, not all zero, such that f(i) = 0 
for i = 0 and all i such that dE;¢' # 0. But then (30.10) gives 
IC] 3o¥_ nic? = 0, a contradiction. 

Assume equality holds in (i). Then (30.10), with all c¢;’s equal 
to 1, shows that f(i) = >°/_) Pe(i) = 0 for the e or more values of 
i for which ¢E;¢' 4 0. We claim that f(i) = 0 cannot hold for 
more than e values of i. This is because f(0), f(1),..., f(&) are the 
eigenvalues of Ag + A; +---+ Ae, which is a polynomial of degree 
e in Aj; that is, f(0),..., f(k) arise from evaluating a polynomial 
of degree e on the eigenvalues P,(0),...,Pi(k) of A;. (The matrix 
A, has k + 1 distinct eigenvalues since it generates an algebra of 
dimension k+1.) Our claim follows from the fact that a polynomial 
of degree e has at most e zeros. So (ii) holds with equality and (iii) 
is proved in this case. 
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Finally, assume equality holds in (ii). Choose scalars co,..., Ce 
so that f(i) = 0 for all i such that ¢E;¢' 4 O and such that 
f(0) =1. Then (30.10) yields |C| = N(c3 + cin, +---+c?n,.). The 
Cauchy-Schwartz inequality shows that 


“(oJ Ee) 


so |C|] > N/(L+n,+no+---+n,). Thus equality holds in (i). 


Corollary (Lloyd’s theorem). Ifa perfect e-error-correcting code 
of length n over an alphabet of size q exits, then 


has e distinct integral zeros. 


Proor: This is just Theorem 30.6(iii) stated for the Hamming 
scheme. The sum }7/_, Pe(a) where P)(x) is as in Theorem 30.1(ii) 
simplifies to L.(x) as stated above. 


Example 30.9. We show that there are no nontrivial perfect 2- 
error-correcting binary codes. If such a code has length n > 2, then 
its cardinality is 2"/(1+n+(5)), so l+n+(5) = 2” for some integer 
r. (That n has this property is unusual—but it may happen, e.g. 
for n = 90.) From Theorem 30.1(ii), for H(n, 2), 


Po(x) + Py(x) + Pa(x) = 2a* — 2(n + 1)e+1l+n+ (5) 


and Theorem 30.6(iii) then asserts that 
x? —(n+1l)e+2"!'=0 


has two integral roots 2; and x2. We must have x; = 2% and x2 = 2? 
for positive integers a and b where a+b = r—1 and 27+2° = n+1. 
Then it may be easily verified that 


QO ate pbtl1 —! 1)? as gatb+4 ea 
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If a and 6 are both > 2, then the left-hand side is = 1 (mod 16) 
while the right-hand side is = 9 (mod 16), a contradiction. The 
reader may consider other cases and will find that the only possi- 
bility is {a,b} = {1,2} in which case n = 5. 


Theorem 30.7. Let D be a 2s-design in a k-class cometric asso- 
ciation scheme on a set X. Let ¢ € R* be the characteristic vector 
of D. 

(i) We have 

|D) > 1L+m,+m2+---+ms. 
(ii) There are at least s indices i € {1,2,...,k} such that 
oAid! £0. 
(iii) Equality holds in (i) if and only if equality holds in (ii), in 


which case the s indices i such that ¢6A;¢' 4 0 are exactly those 
integers 1 for which 


3 Qe(i) = 0. 
(=0 


PRooF: Let @ be the characteristic vector of a 2s-design D and 
consider the expression 


B= (coy +f +++ + css) 0 (eopfp taf te+- + csE.)o! 


where co, C1,.--,Cs are scalars. We shall evaluate @ in two ways. 
Introduce 


g(i) = <(coQali) + rQi(i) +--+ 6,Q.(@) 


as a function of the c;’s. By (30.4), coop +t ak, +---+¢,F, = 
Sg g(t) Ai sO 


k k a k 
p= ( aoe4,) 0 (>: avo.) = g(t)". 
i=0 i 
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On the other hand, FE; is a Hadamard polynomial of degree 7 in 
FE, so the Hadamard square of Ey + FE, +---+ EF, is a Hadamard 
polynomial of degree 2s in E; and hence a linear combination of 
Eo, F\,...,F;. Our hypothesis implies that dE;¢' = 0 for i = 
1,2,...,2s. Thus to evaluate @, we need only the coefficient of Eo 
when the Hadamard square of fy + FE, +---+ E, is written as a 
linear combination of Eo, E1,..., £25; by (30.7) and (30.8), 


1 s 1 s 
= € > cea) bE! = N2 (> an) |e. 
i,j=0 i=0 


Note that by (30.6), g(0) = x (como +cym, +---+c¢,m,). We 
now combine the two values for 3 to obtain 


(30.11) t+ cm +---+em,)|Cl? = Yai oA! 


1 
ne! 


> —(co tems t---+ csms)?|C]. 


alee 


Everything will follow from (30.11) in a manner similar to that 
of the proof of Theorem 30.6. 


Problem 30I. Consider the extended Golay code G4. For no- 
tational convenience, arrange the coordinates so that the word 
c = (1,...,1,0...,0) with 1’s in the first eight positions, and 0’s 
elsewhere, is a codeword. 

(i) Use counting arguments to show that G24 contains 30 words 
of weight 8 that have no 1’s in the first eight positions (so: distance 
16 to c). Then show that all the words in Gy, that have no 1’s in 
the first eight positions form a subcode that is essentially R(1, 4). 

(ii) Let A be the set of 16-tuples (a9, ai0,...,@24) obtained by 
dropping the first eight coordinates from words a = (a1, d@9,..., a1) 
in Go, such that the weight of (a), a2,...,a7) is < 1. Show that A 
is a code of length 16 with 256 codewords and minimum distance 
6. 

(iii) Use A to construct some examples of codes where equality 
holds in Figure 30.2. 
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Problem 30J. Show that a binary linear code with parameters 
[13,6,5] does not exist. (And hence there are no binary linear 
codes with parameters [16, 8, 6] either.) 


Problem 30K. The odd graph has as vertices the k-subsets of a 
(2k + 1)-set, joined by an edge if they are disjoint. (For k = 2 this 
is the Petersen graph.) 

(i) Show that this graph is distance regular. 

(ii) Show that if k = 3, the vertices ‘corresponding to’ the Fano 
plane form a perfect code in the graph. 


Problem 30L. Find all examples of perfect 3-codes in the Johnson 
scheme J(v,3). (All the examples are trivial.) 


Notes. 

Association schemes were introduced by statisticians at the same 
time as partially balanced designs. An incidence structure is par- 
tially balanced with respect to an association scheme on its points 
when each pair of i-th associates occurs in a constant number A; 
of blocks. Examples include the partial geometries of Chapter 21. 
Partially balanced designs were introduced to get around Fisher’s 
inequality b > v, or equivalently r > k, which need not hold for 
them. Replications cost money (r stands for ‘replication number’) 
or take time, so for practical reasons, one wants r small. 

To do the analysis of an experiment for which a certain design 
(incidence structure) has been used, it is necessary to do calcula- 
tions with NN' where N is the incidence matrix; these calculations 
are vastly simplified for partially balanced designs because in an m- 
class scheme 


NN! = AoAo + MA + res + Ang Arn 


lies in the small-dimensional Bose—Mesner algebra even though the 
size of the matrices may be very much more than the dimension 
m+1. 

Delsarte’s inequalities imply the general Erdds-Ko-Rado theo- 
rem: If n > (+ 1)(k —t+1) and F is a family of k-subsets of an 
n-set so that any two of members of F meet in at least t points, 


then |F| < (71). See Wilson (1984a). 


380. Association schemes 431 


See Van Lint (1999) for more on the nonexistence of perfect 
codes. 
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(More) algebraic techniques 
in graph theory 


We have used linear algebraic techniques on the adjacency matrices 
of graphs in Chapter 9, and extensively in Chapter 21. We collect 
here several other elegant applications. Also see Chapter 36. 

A tournament is an orientation of a complete graph; that is, a 
directed graph such that for any two distinct vertices x and y, there 
is either an edge from x to y, or an edge from y to x, but not both. 
Tournaments were introduced briefly in Problem 3D. The adjacency 
matrix of a digraph has a 1 in position (x,y) when there is an edge 
from x to y, and 0 otherwise. 


Lemma 31.1. The rank of the adjacency matrix A of a tourna- 
ment on n vertices is either n or n — 1. 


Proor: The definition of tournament ensures that A+A! = J—I, 
where all matrices are n x n. Suppose the rank of A is at most 
n — 2. Then there exists a nonzero row vector x such that both 
xA =0 and xJ = 0. We then compute 


0=x(A+A")x’ =x(J—I)x'’ =—xx' <0, 


which contradicts the existence of x. 

The following theorem is due to R. L. Graham and H. O. Pollak. 
Their original proof applied Sylvester’s law, which they had also 
used for their proof of Theorem 9.1. 


Theorem 31.2. Suppose that the complete graph K,, can be ex- 
pressed as the union of k edge-disjoint subgraphs H,, Ho,..., Hr 
where each H; is a complete bipartite graph. Then k > n—1. 
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PROOF: Orient each complete bipartite subgraph H; by directing 
all edges from one of the two color classes to the other. This pro- 
duces a tournament on n vertices whose adjacency matrix A is the 
sum of the k adjacency matrices A; of the digraphs H; (augmented 
to n xX n matrices by including all vertices). The adjacency ma- 
trix of such a complete directed bipartite subgraph, after a suitable 
renumbering of the vertices, has the block form 


Ooh. 
O O O 
O O O 


and in particular has rank 1. This implies that the rank of A is at 
most k, and Lemma 31.1 completes the proof. 


This brings to mind the De Bruijn—Erdés theorem, Theorem 19.1, 
which asserts, in a quite different terminology, and which was given 
a quite different proof, that if K,, is the union of k edge-disjoint 
complete subgraphs, then k > n. But no proof of Theorem 31.2 is 
known which does not use linear algebra. 

As a real symmetric matrix, the adjacency matrix A = A(G) of 
a finite graph G has an orthogonal basis of eigenvectors. (Eigen- 
vectors corresponding to different eigenvalues are necessarily or- 
thogonal.) By the ‘eigenvalues of G’ we mean the eigenvalues of 
A(G). Here is a short table of the spectra, i.e. the complete lists of 
eigenvalues, of several graphs: 


graph spectrum 

Ks 4,—1,-1,-1,-1 

K33 3, 0, 0, 0,0, —3 

Cube 3, 1,1,1, -1, -1,-1,-3 
Pentagon 2,4(-1+ V5), 3$(-1+ V5), 


4-1 — v8), -1 - vB) 
Petersen graph | 3,1,1,1,1,1,-2,—2,-—2,-2 
L2(3) 4,1,1,1,1, -2, -2, -2,-2 
Heawood graph | 3, V2, V2, V2, V2, V2, v2, 

-V2, -V2, -V2, -vV2, -vV2, -V2, -3 
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(The Heawood graph is the ‘bipartite incidence graph’ of the 
Fano configuration, with seven vertices representing the points and 
seven others representing the lines. It appears again in Chapter 
35.) 

Nonisomorphic graphs may have the same spectra. For example, 
we mentioned in Chapter 21 that there exist four nonisomorphic 
strongly regular graphs with the parameters of T(8); cf. Theorem 
21.5. 


Problem 31A. Show that it is not possible to find three edge- 
disjoint copies of the Petersen graph in Kyo. Hint: if A,, Ao, and 
A3 are the adjacency matrices of three edge-disjoint Petersen sub- 
graphs of Kyo, then all three matrices have the same spectrum, all 
have the all-one vector j as an eigenvector, and A;+ A+ A3 = J—I. 

Let S be asymmetric matrix. The expression xSx!/xx!, for x 4 
0, is called a Raleigh quotient. Let e;,e2,...,e, be an orthonormal 
basis of eigenvectors with corresponding eigenvalues 

At = Az 2 ale > dips 

If we write x in this basis, say x = aye, +--- + anen, then 
xSx' — Ayaj + Agas +--+ + Ana? 


31.1 
ae xx! a? +a3+---+a2 


In particular, for any nonzero x, 


a 
2 Mis 


xx! 

While most graphs we will consider are simple, we remark that 
there is nothing to prevent the consideration of adjacency matrices 
of multigraphs below (where the entry in row x and column y is 
the number of edges joining x and y), or the adjacency matrices 
of ‘weighted’ graphs (where the entry in row x and column y is a 
‘weight’ associated to the edge joining x and y). 

It is at first surprising that the spectrum of a graph has anything 
at all to do with the more geometric or combinatorial properties of 
the graph. One of the first connections observed was the following 
theorem of A. J. Hoffman. A coclique, or an independent set of 
vertices, in a graph G is a set of vertices no two of which are 
adjacent. 
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Theorem 31.3. Let G be a graph on n vertices which is regular 
of degree d and let Amin be the least eigenvalue of G, so Amin is 
negative. Then for any coclique S in G, 


Sl< min ; 
| | _ d _ Awmin 


PROOF: Let A be the adjacency matrix of G and XA := Amin. Then 
A-— XI is positive semidefinite, i.e. all eigenvalues are nonnegative; 
note that the eigenvectors of A — AJ are the same as those of A. 
One of the eigenvectors is j := (1,1,...,1), and we have 


j(A- 1) = (d— dj. 


Then with 


we will have jM = 0. Every other eigenvector e of A may be 
taken to be orthogonal to j, so eJ = 0, and is seen also to be an 
eigenvector of MM with nonnegative eigenvalue. That is, M is also 
positive semidefinite. 

Now let ¢ be the characteristic vector of a coclique S' consisting 
of m vertices of G, ie. o(x) is 1 if x € S and 0 otherwise. Then 
~Ad! = 0 and we have 


0<¢Md' =—)ddd! — ao ag" = —\m— An? 


The stated inequality follows. 


There are extensions and variations on Theorem 31.3 for nonreg- 
ular graphs—see Haemers (1979). 


Problem 31B. Apply Theorem 31.3 to the complement of a sim- 
ple regular graph G to obtain an upper bound on the size of a clique 
in G in terms of the spectrum. Find several examples where your 
bound is met. 


Problem 31C. The line graph L = L(G) of a simple graph G 
was introduced in Chapter 17. For example, L2(m) as defined in 
Chapter 21 is the line graph of Kim, T'(m) is the line graph of Kin, 
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and the line graph of a pentagon is a pentagon. (i) Show that the 
Petersen graph is not the line graph of any graph. (ii) Prove that 
if G has more edges than vertices, then the minimum eigenvalue 
of the line graph L(G) is —2. (Suggestion: let N be the incidence 
matrix of G and consider NN.) 


L. Lovasz (1979) observed that the bound on the size of a coclique 
in Theorem 31.3 is also a bound on what is called the Shannon 
capacity of a graph. This is a concept that arose in information 
theory. Suppose a set of ‘letters’ is to be used for transmitting 
messages. Some pairs of letters are assumed to be ‘confusable’ or 
‘confoundable’. We say two potential messages (words or strings 
of n of these letters) are confoundable when in each coordinate the 
letters are either identical or confoundable. We desire a set of words 
no two of which are confoundable. 

To state this in more graph-theoretic terminology, we introduce 
the graph G whose vertices are the letters and where two vertices 
are adjacent if and only if the letters are confoundable. Thus a set 
of letters, no two of which are confoundable, is exactly a coclique in 
this graph. The strong product G® H of simple graphs G and H is 
the simple graph defined by V(G @ H) := V(G) x V(#) and where 
distinct vertices (x1, y1) and (22, y2) are adjacent in G@® H when 21 
and x2 are equal or adjacent, and y; and yo are equal or adjacent. 
The vertices of G’ := G®G®---®G (n factors) correspond to 
words of length n, and two are adjacent in G” just when the words 
are confoundable. Thus we are interested in the size of a largest 
coclique in G”. 

The independence number a(G) of a graph G is the largest car- 
dinality of a coclique in G. The Shannon capacity of a graph G 
is 

@(G) := lim (a(G"))"" = sup(a(G"))"". 
That the limit exists and is equal to the supremum follows from the 
observation that the product of cocliques in G and H is a coclique 
in G @ H, and thus that a(G*t™”) > a(G*)a(G™). (See Fekete’s 
lemma in Chapter 11.) 


Example 31.1. The problem of evaluating the Shannon capacity 
of such simple graphs as the pentagon P; was open for many years 
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before the relevance of algebraic techniques was noticed by Lovasz. 
It is easy to find 2” words of length n, no two of which are con- 
foundable, e.g. all words of 1s and 3s. This shows O(P;) > 2. This 
is not so good: we get only four words of length 2 this way, when 
it is not hard to find a set of five: 


(1,15 (23) .(8,9)5 (4,2), 5,4). 


For even values of n, we can take the product of n/2 copies of the 
above five words to get (/5)" pairwise nonconfoundable words of 
length n; this shows O(P;) > 5. Can we do better, perhaps for 
very large values of n? No, as we shall see after Theorem 31.6, 


Q(P5) < V5. 


Here is the approach of Lovasz (1979). First, observe that with 
the definition of Kronecker product of Chapter 18, we have, for 
vectors x,y € R”, v, we R", 


(31.2) (x ov)(yow)! = (x,y)(v,w). 


Let G be a graph. For simplicity we shall always assume that its 
vertices are 1,2,...,n. An orthonormal representation of G is a 
system (vj,...,V,) of unit vectors in a Euclidean space such that 
if ¢ and 7 are nonadjacent vertices, then v; and v; are orthogo- 
nal. Clearly, every graph has an orthonormal representation, for 
example by pairwise orthogonal vectors. 


Lemma 31.4. Let (uj,...,u,) and (vj,...,Vm) be orthonormal 
representations of G and H respectively. Then the vectors uj; ° v; 
form an orthonormal representation of G ® H. 


PRrOoF.: This is a consequence of (31.2). 


Define the value of an orthonormal representation (u1,...,U,) to 
be 


A es — ao 
e- 1s35% (c, u;) 


where c ranges over all unit vectors. The vector c yielding the min- 
imum is called the handle of the representation. Let 6(G) denote 
the minimum value over all repesentations of G. It is easy to see 
that this minimum is attained. A representation with value 6(G) 
is called optimal. 
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Lemma 31.5. 6(G ® H) < 0(G)60(#). 


Proor: Let (uj,...,u,) and (v),...,Vm) be optimal orthonormal 
representations of G and H, with handles c and d, respectively. 
Then, by (31.2), cod is a unit vector, and again by (31.2), 


1 
ef 
OMG te a (cod, u; 0° v;)? 


= Max : : : 
Ta (eu? (dv, 
= 6(G)6(H). 


(One can prove that equality holds in this lemma.) 
Theorem 31.6. O(G) < 0(G). 


PROOF: We first show that a(G) < 6(G). Let (ui,...,u,) be an 
optimal orthonormal representation of G with handle c. Suppose 
that {1,2,...,4} is a maximum independent set in G and hence 
U,,..., Uz are pairwise orthogonal. Therefore 

k 

1 =|lel? > So (c, ui)? > a(G)/A(G). 

i=1 

From this and Lemma 31.5 we find that a(G") < 0(G") < 0(G)”. 


Example 31.1, continued. We can now show that 0(C;) = V5 
by an elegant idea from Lovasz (1979). Consider an umbrella whose 
handle and five ribs have unit length. If we open such an umbrella, 
we will pass a point where the angles between nonadjacent ribs are 
all 7/2. If we call the handle c and the ribs u;, to us, all oriented 
away from their common point, then uj,...,Us5 is an orthonormal 
representation of Cs in R*. It is an easy calculation to show that 
(c, u;) = 57 !/*. Theorem 31.6 and the definition of (C5) then yield 
the result. 
The result also follows form the corollary below. 


Corollary. Let G be a graph on n vertices which is regular of 
degree d and let \min be the least eigenvalue of G. Then 


=f ata 


< —_—_.. 
AG) ~ d _ Anis 
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PRoor: Let A be the adjacency matrix of G. In the proof of 
Theorem 31.3, we saw that with A := Amin, M := A- AI — Ay 
was positive semidefinite (and singular). Thus M = BB! for some 
real matrix B of rank < n. Let the rows of B be x1,...,xn} we 
have 

d—> d—x 


(5c, 26) =—-A- a ’ (ne) = FS ) 


the latter when 7,7 are not adjacent. Let c be any unit vector 
orthogonal to the rows x; of B and define 


1 1 
Vv; = —=x; + Cc. 
V-A \/—An/(d — X) 
Check that vj,...,V, is an orthogonal representation of G. Finally 
note that for any 7, 
1 _ An 
(ce, vi)? Qe 


The result follows from Theorem 31.6. 


Next, we give two applications of ‘interlacing’ of eigenvalues. 


Lemma 31.8. Let A be asymmetric matrix of order n with eigen- 
values 
At = A2 = we Die 


Suppose N is anm x n real matrix such that NN' = In, som <n. 
Let B:= NAN, and let 


fai ee Se, 


be the eigenvalues of B. Then the eigenvalues of B interlace those 
of A, in the sense that 


Vi = Li = Nias 


for 1 Dei: 
PROOF: Let e),...,@, be an orthonormal basis of eigenvectors of 
A corresponding to j,..., An, and let f,,...,f,, be an orthonormal 


basis of eigenvectors of B corresponding to j11,..., Um- 
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Fix i and consider U := span{f,,...,f;}. By (31.1), 
xBx! 
xx! 


for every nonzero x € U. Let W := {xN :x € U}. Then W is an 
i-dimensional subspace and 


> [i 


yAy! 
yy! 
for every y € W. Choose y ¥ 0 in WN span{ej, e;41,..., en}. 
Then in addition to (31.3), we have 


(31.3) 


= Li 


yAy! 
yY 
from (31.1) and this proves \; > pj. 
A similar argument with U := span{f;j,...,f,} will prove that 
Li = Anaad 
The term interlacing seems most natural in the case m = n— 1 
where 


Ay > fy 2 Ag > Me 2 AZ 2 & An-1 2 Mn-1 2 An- 


An important special case of Lemma 31.8 occurs when the rows 
of N are m distinct ‘standard basis’ vectors of length n, i.e. with a 
single 1 in each row. In this case the matrix B is just an m x m 
principal submatrix of A. The following observation is due to D. 
M. Cvetkovié. 


Theorem 31.9. The size of a coclique in a graph G cannot exceed 
the number of nonnegative eigenvalues, or the number of nonposi- 
tive eigenvalues, of G. 


PrRooF: If G has a coclique of size m, then the adjacency matrix A 
has an m Xx m™ principal submatrix of all zeros. If the eigenvalues 
of A are \y > Ax > -:: > Ay, then by interlacing, A,, > 0 and 
0 > An—-m41, so A has at least m eigenvalues which are > 0 and at 
least m which are < 0. 


The next theorem is a result of A. J. Hoffman on the chromatic 
number of a graph. We remark that for regular graphs, this would 
follow from Theorem 31.3. 


31. (More) algebraic techniques in graph theory 441 


Theorem 31.10. Let \, >--- >A, be the eigenvalues of a graph 
G. Then 
x(G) 2 1+ A1/(—An)- 


PROOF: Suppose G can be properly colored with m colors. The 
color classes induce a partition of the adjacency matrix 


Au ay Aims 
As), 3 S|, 
Ami we Apr 


where A;; is the submatrix consiting of the rows indexed by the 
vertices of color 7 and columns indexed by the vertices of color j. 
Each of the diagonal blocks Aj; is a square zero matrix. 

Let e be an eigenvector of A corresponding to the maximum 
eigenvalue A, and write e = (e€),...,€,) where e; has coordinates 
indexed by the vertices of color 7. Let 


1 
Ter © he 0 
Nex 0 Teal ©2 0 
= 0 0 To E3 


(an m x n matrix) and let B := NAN'. By Lemma 31.8, the 
eigenvalues [1,..., 4m of B interlace those of A and hence are be- 
tween A; and X,. On the other hand, we have constructed B so 
that A, is an eigenvalue since 


(ller|],---, llemll)B =eAN' = AyeN' = Aj ((lei|,--- llemll)- 
Finally note that the diagonal entries of B are zeros, so 
0 = trace(B) = py +--+ + pm > AL + (M=—1)Aq. 


This proves the theorem. (But perhaps we should have said that if 
some of the e; are zero, those rows should not be included in N.) 
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Problem 31D. Show that for any finite graph G, y(G) <14+ A; 
where , is the largest eigenvalue of G. Hint: consider the degrees 
of an induced subgraph of G that is minimal with respect to having 
the same chromatic number as G. 


In addition to being symmetric, the adjacency matrix of a graph 
is also a nonnegative matrix. There is a useful theory of such matri- 
ces, at the center of which is the Perron—Frobenius theorem, The- 
orem 31.11 below. A complete proof may be found in Gantmacher 
(1959). 

A square matrix A with rows and columns indexed by a set X, 
say, is called irreducible when it is not possible to find a proper 
subset S of X so that A(z, y) = 0 whenever x € S andye€ X\S. 
Equivalently, A is not irreducible if and only if it is possible to apply 
a simultaneous row and column permutation to obtain a matrix of 


the form 
B O 
C D 


where B and D are square and of order at least 1. It should be clear 
that if A is the incidence matrix of a graph, then it is irreducible 
if and only if the graph is connected. 


Problem 31E. Let D be a finite digraph. Let A the adjacency 
matrix of D, or any matrix with rows and columns indexed by the 
vertices of D and where A(z, y) = 0 if there are no edges directed 
from x to y and A(xz,y) > 0 if there are such edges. (i) Show that 
A*(x,y) > 0 if and only if there is a directed walk of length k from 
x to y in D. (ii) Show that A is irreducible if and only if D is 
strongly connected, that is, if and only if for any two vertices of D, 
there is a directed path from x to y. 


Theorem 31.11 (Perron—Frobenius). Let A be an irreducible 
nm X n nonnegative matrix. There is, up to scalar multiples, a 
unique eigenvector a = (a1,...,@,) all of whose coordinates a; are 
nonnegative. The coordinates of this vector a are, in fact, strictly 
positive. The corresponding eigenvalue (which is called the dom- 
inant eigenvalue of A) has algebraic multiplicity 1 and has the 
property that \ > || for any eigenvalue pu of A. 
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PARTIAL PROOF: We omit the proof of the existence of a nonnega- 
tive eigenvector, but show how irreducibility is used and give much 
of the rest of the proof. 

We will show that any nonnegative eigenvector a of A, corre- 
sponding to eigenvalue , say, is positive in all coordinates. Con- 
sider A as the matrix of a strongly connected digraph as in Problem 
31E. Since there is a directed path from any vertex to any other of 
length <n —1, 


LAs Ae AP sO, 


Here and below, an inequality for matrices or vectors is understood 
to hold in all coordinates. Since a > 0 but a 4 0, 


O<a(f+A+A274+---4 A771) =(14A4--- +A” 4a, 


and so 0 <a. 

Next we show the eigenvalue \ has geometric multiplicity 1, i.e. 
that the corresponding eigenspace has dimension 1. Let a’ be any 
eigenvector corresponding to eigenvalue A, with coordinates that 
are positive, negative, or zero. If we add or subtract small scalar 
multiples of a’ from a, we obtain other positive or nonnegative 
eigenvectors a — ca’ corresponding to eigenvalue A. We can choose 
cso that a+ca’ > 0, but so that at least one coordinate is zero. This 
contradicts the result of the previous paragraph unless a— ca’ = 0. 
We omit the proof that \ has algebraic multiplicity 1, i-e. is a simple 
zero of the characteristic polynomial. 

Let a be the positive eigenvector for A, with eigenvalue 4. Let b 
be the positive eigenvector of A'; say Ab' = vb!'. Then 


dab! = (aA)b! = a(Ab!) =vab!. 


This is a contradiction unless vy = \, so Ab! = Ab'. Suppose u is 
an eigenvector of A that is not a scalar multiple of a, say uA = pu 
with u ~A A. Then 


ub! = u(Ab') = (uA)b! = pub'. 
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which implies ub' = 0. So any other eigenvector u cannot have 
nonnegative coordinates. Now let t := (|uj|,...,|u,/) and check 
that uA > ||U follows easily from A > O. Then 


ab! = a(Ab') = (@A)b! > |p| ab", 


from which we have \ > ||. 


Theorem 31.12. Let \ be the dominant eigenvalue of a connected 
graph G. Then G is bipartite if and only if —A is also an eigenvalue 
of G. 


PROOF: First suppose G is bipartite. Then the adjacency matrix 
A of G is, after renumbering the vertices if necessary so that the 
color classes consist of the first k and last n — k, of the form 


O B 
4=(5 6) 


where B is k x n—k. Let e be an eigenvector corresponding to 
an eigenvalue 4 and write e = (e),e2) where e; consists of the 
first k coordinates. It may be checked that (e1,—e2) is then an 
eigenvector of eigenvalue —j. So if G is bipartite (connected or 
not), its spectrum is in fact symmetric about 0. 

Now suppose A is the dominant eigenvalue of a connected graph 
G and that —A is also an eigenvalue. Let e = (e;,—e2) be an 
eigenvector of unit length corresponding to —A, where we have 
numbered the vertices so that e; and eg both have all nonnegative 
coordinates. Partition the adjacency matrix A accordingly: 


Bs iG 
Ge) 
Then 
—\ = eAe' =e,Be}] + e,He, — e,Ce, — e,De}, 


and hence 


(31.4) (e1, e2) A(e1, e) | > A 
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with equality if and only if 
(31.5) e, Be, = eEe, = 0. 


The equation (31.1) and the fact that A is the dominant eigenvalue 
imply that equality must hold in (31.4) and that (e1, e2) is an eigen- 
vector corresponding to eigenvalue A. Then Theorem 31.11 implies 
that all coordinates of (e;,e2) are positive. Finally, (31.5) shows 
that B = E =O and this means that G is bipartite. 


Theorem 31.13. Let G be a finite graph and A its adjacency 
matrix. There exists a polynomial f(x) such that f(A) = J if and 
only if G is connected and regular. 


PROOF: Suppose f(A) = J for some polynomial f(x). Then, since 
polynomials in a given matrix commute, AJ = JA. The entry in 
row x and column y of AJ is degg(x); the entry in that position of 
JA is degg(y). Hence G is regular. 

If f(A) = J, then, in particular, for any x and y, we must have 
A*(x,y) #0 for some k. The entry in row x and column y of A* 
is the number of walks of length k from x to y in G. Hence G is 
connected. 

Now suppose G is connected and regular of degree d. Then j is an 
eigenvector of A corresponding to eigenvalue d. By Theorem 31.11, 
the eigenvalue d has multiplicity one. For any symmetric matrix M, 
the matrix of the orthogonal projection onto any of its eigenspaces 
is a polynomial in M; explicitly, if (a — 1)(@ — w2)--+ (a — pg) is 
the minimal polynomial of /, then it is an exercise to show that 


1 
(Ha — fe) +++ (t41 — Be) 


(MS jot) se Me) 


is the orthogonal projection onto the eigenspace {a: aM = py,a}. 
Since the orthogonal projection onto the span of j is J , this matrix 
is a polynomial in A. 


Isomorphic graphs will have the same eigenvalues, but, as we 
mentioned earlier, the converse is not true: nonisomorphic graphs 
may have the same spectra (multisets of eigenvalues). See Problem 
210. But we may have a partial converse, in particular for ‘circulant 
graphs’ with a prime number of vertices. 
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Let G be an abelian group, written additively. For S C G, the 
Cayley graph T(G,S) has vertex set G, and there is an edge directed 
from xz to y in [(G, S) if and only if y— ax € S. Cayley graphs have 
arisen numerous times in the preceding chapters. A Cayley graph 
may be thought of as an undirected graph when —S = S, and will 
have no loops when 0 ¢ S. A circulant graph is a Cayley graph 
I(G,S) where G is cyclic. 

(Cayley graphs can also be defined for nonabelian groups. But 
we consider only the abelian case below.) 

A G-matrix is a square matrix A whose rows and columns are 
indexed by the elements of G and where for some vector of objects 
(ag: 9 € G), A(i,j) = aj. A circulant matrix (or simply a 
circulant) is a G-matrix where G is cyclic. In general, the adjacency 
matrix of a Cayley graph based on G is a G-matrix. 

We remark that G-matrices over a ring R are closed under mul- 
tiplication and the algebra of such matrices is isomorphic to the 
group ring R/G] as in Chapter 28. 


Proposition 31.14. Let w bea primitive nth root of unity in some 
field F and U the matrix with rows and columns indexed by Z,, with 
U(i,j) =w". Let A be a circulant based on elements (a; : i € Zn) 
of F. Then UAU™~ is the diagonal matrix with diagonal entries 


n-1 


2 Or a 
i=0 


That is, the eigenvalues of A are Xo, 1,.--,;An—1 (and the corre- 
sponding eigenvectors are the rows of U). 


Problem 31F. (i) Prove Proposition 31.14. (ii) For those familiar 
with the characters of an abelian group: Show that the eigenvalues 
of a rational G-matrix A are A, = Da aeG agxX(g), aS X ranges over 
the characters of G. Cf. Example 30.6. 


The following is from Elspas and Turner (1970). 


Theorem 31.15. Let A and B be rational circulant matrices of 
prime order p > 2, based on vectors (a; : i € Zp) and (b; : i € Zp), 
respectively. If A and B have the same spectrum, then for some 
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t £0 in Z,, a; = by for alli € Z,, where the subscripts are read 
modulo p. 


PROOF: Let w be a complex primitive pth root of unity. Then, in 
view of Proposition 31.14, a := een a;w' is an eigenvalue of A and 
so is one of the eigenvalues of B, and so 


n-1 n-1 


y aw’ = 5 byw 
i=0 i=0 


for some s. 
First, assume s 4 0, and choose t so that st = 1 (mod p). Then 


p-l1 
(a; _ bi)w" = 0: 


But as the minimal polynomial of w over the rationals is 1+. 2+2a?+ 
---+a?-! we may conclude that for some constant c, a;—b;; = c for 
alli. The traces of A and B, which must be equal, are, respectively, 
pag and pbo. Hence ap = bo, and then c = 0. 

If s = 0, then a is rational and 


(a9 — a) taqw+-+++a,1w?! = 0. 


We may concude that all coefficients above are equal to some con- 
stant c; this means A= al+cJ. 

We repeat the argument with the roles of A and B interchanged, 
and we find that either the statement of the theorem holds or that 
B=(I+dJ for some rational numbers (@ and d. 

The eigenvalues of al + bJ are a+ pb of multiplicity 1 and a+b 
of multiplicity p —1. If A = al +cJ and B = GI +dJ have the 
same spectra and p > 2, it follows that A = B and the theorem 
holds for any t. 


We remark that for p = 2, the statement of the theorem is not 
valid, since e.g. the circulants 


a) a a) 


have the same spectrum {2,0}. However, it is valid if A and B are 
both nonnegative. 
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Corollary. Circulant graphs of prime order are isomorphic if and 
only if they (their adjacency matrices) have the same eigenvalues. 


PROOF: We have asserted before that isomorphic graphs have the 
same spectra. 

Suppose the circulant graphs ['(Z,,S) and ['(Z,,T), for some 
S,T C Z,, p prime, have adjacency matrices A and B with the 
same spectra. Here A is the circulant based on (a; : i € Z,) where 
a; = 1 if 7 € S and a; = O otherwise; similarly, and B is based on 
(b; : 2 © Z,) where b; = 1 if i € T and b; = 0 otherwise. Theorem 
31.15 says (for p = 2, we need the remark) a; = b;; for some t 4 0. 
Then the permutation ¢ : i + ti of Z, is an isomorphism of I'(Z,, S) 
onto I'(Z,,T) because 


i v~jinT(Z,S)<—i-jes 
=> e797) eT SS t= fy in T(Z,,T), 


where x — y means there is an edge directed from x to y. 


Corollary. Given S,T C Z,, the circulant graphs 1'(Z,,S) and 
I'\(Z,,T) are isomorphic if and only if T = tS for some ‘multiplier’ 
t £0 in Z. 


Proor: If [(Z,, 5S) and I'(Z,,T) are isomorphic, then they have 
the same spectra, and as in the proof of the preceding corollary, we 
have a; = b;; for some t 4 0, which means T = tS. 

Conversely, if T = tS, then as we saw above, @: i +> ti of Zy is 
an isomorphism. 


It had been conjectured that the corollary above holds when p 
is replaced by any integer n, and the condition t 4 0 replaced 
by (t,n) = 1, but counterexamples exist for n = 8,9 and other 
values. In fact, the corollary with p replaced by n holds exactly 
when n is square-free or 4 times an odd square-free number. See 
M. Muzychuk (1997). 


Problem 31G. Prove that the vertices of any finite simple graph 
may be colored red and blue so that each red vertex is adjacent to an 
even number of red vertices, and each blue vertex is adjacent to an 
odd number of red vertices. First prove the lemma: if S = (aj;;) isa 
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symmetric (0,1)-matrix, then the diagonal a = (a1, 22,..-,@nn), 
considered as a column-vector, is in the span of the columns of S 
over the field Fo. 


Problem 31H. Let G be a connected regular graph with exactly 
three distinct eigenvalues. Show that G is strongly regular. 


Problem 311. Let G be a nontrivial strongly regular graph and 
for x € V(G), let A(x) denote the subgraph induced by the vertices 
nonadjacent to x. Prove that A(a) is connected by (1) explaining 
why &— would be an eigenvalue of G if A(x) were not connected, 
and (2) showing by calculation that /— y is not an eigenvalue of G. 


Problem 31J. Suppose the Petersen graph P is the union of span- 
ning subgraphs H, K, that are regular of degrees 1 and 2, respec- 
tively. Let A be the adjacency matrix of P and B, C the adjacency 
matrices of H and K, so that A= B+C. 

Show that there exists a vector u that is an eigenvector of A corre- 
sponding to eigenvalue 1 and also an eigenvector of B corresponding 
to eigenvalue —1 (and hence an eigenvector of C' corresponding to 
eigenvalue 2). Next, explain why K cannot be a connected graph. 
(This shows that P is not Hamiltonian.) 


Problem 31K. Let G be a directed graph on n vertices without 
multiple edges. We allow edges a — b and 6 — a to occur simul- 
taneously in which case we call {a, b} an undirected edge. Suppose 
that for any two vertices a,b there is exactly one walk of length 3 
from a to b if a # b and no such walk if a = b. If A is the adjacency 
matrix of G, then this property is expressed by the equation 


(31.6) A= JI. 


(i) Prove that G is regular; 
(ii) If ¢ is the valency of G, show that n = c? + 1; 
(iii) Prove that G has exactly $(C +c) undirected edges; 
(iv) Construct an example of such a graph if n = 9. 
Notes. 
Theorem 31.3 was never published by Hoffman, one of the first 
‘algebraic graph theorists’, but caught the fancy of many of those 
who heard of it, and has become a classic. 
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Problem 31D is a result of H. Wilf. Problem 31A is an unpub- 
lished result of A. J. Schwenk. 
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Graph connectivity 


For k > 2, a graph G is said to be k-vertex connected, or simply k- 
connected, when |V(G)| > k+1 and the removal of any k—1 vertices 
(and any incident edges) from G does not result in a disconnected 
graph. We use 1-connected as a synonym for connected. 

If a graph G with at least k + 1 vertices is not k-connected and 
the deletion of a set S of k — 1 vertices disconnects it, there is 
a partition of V(G) \ S into nonempty sets X,Y with no edges 
crossing (one end in X, one in Y). Let H and K be the subgraphs 
induced by X US and Y US, except that edges with both ends in 
S are to be put in one and only one of H or K. Then we obtain 
edge-disjoint subgraphs H and K whose union is G and such that 
|\V(H)AV (K)| = k—-1. Conversely, if such subgraphs H and K exist 
and each contains at least one more vertex than their intersection, 
then G is not k-connected. 

A graph is said to be nonseparable when it is 2-connected and has 
no loops, or when it is a bond-graph (with two vertices and any pos- 
itive number of edges joining them, including the link-graph with 
one such edge), a loop-graph (one edge joining a single vertex to 
itself), or a vertex-graph. All polygons, for example, are nonsepa- 
rable; path-graphs or other trees with at least two edges are not. 

(We do not want loops, if there are other edges, because while 
the deletion of the incident vertex may not disconnect the graph in 
a combinatorial sense, it does in a topological sense. With another 
type of 2-connectivity, that of Tutte, we would not need to prohibit 
loops, or allow the small graphs, explicitly; see Problem 32D.) 

Here is a small observation about the ‘structure’ of nonseparable 
graphs. 
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Lemma 32.1. Let G be a finite nonseparable graph with at least 
two edges and H a maximal proper nonseparable subgraph of G. 
Then G is the union of H and a path-graph P that joins distinct 
vertices of H and has none of its internal vertices in H. 


The path in the statement of the lemma may be called a ‘handle’. 
Figure 32.1 is suggestive. 


< 


Figure 32.1 


Proor: It may be that V(H) = V(G). In this case, let e be an 
edge in G but not H and let P be the link-graph consisting of e 
and its ends. Since adding e to H produces a nonseparable graph 
and H is maximal among the proper nonseparable subgraphs, the 
union of H and P is G. 

If V(H) 4 V(G), the connectivity of G implies that there exists 
an edge e with one end x € V(#) and the other y ¢ V(#H). By the 
connectivity of G — x, there exists a simple path that does not pass 
through x, joining y to some other vertex w of H. If z is the first 
vertex of H on this path (as we proceed from y to w), the edge e 
and the initial part of this path provide a path-graph P joining x 
to z that has no edge terms in H and no vertex terms in H other 
than its ends. Then H U P is a nonseparable graph larger than H, 
and so is equal to G. 


As a consequence of the lemma, given a finite nonseparable graph 
G with at least two edges, there is a sequence of nonseparable 
subgraphs 


where Ho is a link-graph and where each Hj+; is the union of H; 
and a handle. 

A subdivision of a graph G is, informally, a graph H obtained 
by inserting extra vertices into some edges. More precisely, we 
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can replace the edges of G by (finite) path-graphs whose internal 
vertices are ‘new’, i.e. not vertices of the original G; and we allow 
the replacing of loops by polygons. Two graphs are said to be 
homeomorphic if they are isomorphic to subdivisions of the same 
graph. Subdivision of a nonseparable graph (other than the link- 
graph) preserves nonseparability. 


Problem 32A. Let us define the rank of a connected graph G as 
|E(G)| — |V(G)| + 1. (in Chapter 34, we will see that this is the 
dimension of what is called there the cycle space of G.) When we 
add a handle to a nonseparable graph, the rank goes up by one. 
Connected graphs of rank 0 are trees. The nonseparable graphs 
of rank 1 are the polygons. The nonseparable graphs of rank 2 
are the 0-graphs, homeomorphic to the letter 0, i.e. subdivisions of 
the bond-graph with three edges, since that is all that can arise by 
adding a handle to a polygon. Describe all nonseparable graphs of 
rank 3 as subdivisions of four graphs. 


It is clear that if G is not nonseparable and H, K are edge-disjoint 
subgraphs with at most one vertex in common, whose union is G, 
then an edge of H and an edge of K cannot be contained together 
in any polygon of G. 


Theorem 32.2. Any two edges in a nonseparable graph G are 
contained in the edge set of a polygon subgraph. 


PROOF: We assert that given distinct vertices x,y and an edge e 
of a nonseparable graph G, there exists a path from x to y which 
traverses e. So if x,y are the ends of another edge e’, we obtain a 
polygon containing e and e’. We prove the assertion by induction 
on the total number of edges (or the rank). 

The link-graph with edge e is a nonseparable subgraph. Let H be 
a maximal proper nonseparable subgraph containing e. If both x 
and y are in H, we are done by the induction hypothesis. Otherwise 
G is the union of H and a path-graph P joining vertices a and b of 
H, say, and at least one of x or y is an internal vertex of P. If both 
x and y are vertices of P, a path from a to b in H that traverses 
e together with segments of P provides the required path. Finally, 
assume x is internal to P and y € V(H). Say y 4a. Then a path 
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in H from y to a that traverses e together with the segment of P 
from x to a provides the required path. 


Corollary. Any two vertices of a nonseparable graph G (other 
than the link-graph) are contained in some polygon in G. 


PROOF: Choose an edge on each vertex and apply the theorem, 
if the edges are distinct. If the edges are the same, there is a 
polygon containing that edge, as long as the graph has a second 
edge somewhere. 


The corollary is actually equivalent to Theorem 32.2, since the 
theorem can be derived by inserting new vertices into the middle 
of the given edges, and applying the corollary to the new vertices. 

The corollary is the special case k = 2 of the following theorem 
of H. Whitney (1932). 


Theorem 32.3. A graph G with at least k +1 vertices is k-vertex 
connected if and only if, for any two distinct vertices x and y of 
G, there exist k internally disjoint paths joining x and y, i.e. paths 
which share no vertices other than their ends x and y. 


This will be an immediate consequence of Menger’s Theorem 
below. We give the latter in the version for directed graphs. As 
usual, an undirected graph may be considered as a digraph when 
we replace each undirected edge by a pair of directed edges, one in 
each direction. 

We will apply Theorem 7.2 on integral flows, and require the 
following observation. (This is almost identical to Problem 7B.) 


Problem 32B. Let f be a nonnegative flow from s to ¢ in a di- 
graph D (capacities are not required). 
(i) Show that if f has positive strength, then there exists a di- 
rected path from s to t all of whose edge terms e have f(e) > 0. 
(ii) Conclude that if f is integral and has strength k, then there 
exist k directed paths p, p2,...,p, from s to t in D such that for all 
edges e € E(D), the number of paths p; that traverse e is < f(e). 


Problem 32C. Let s and t be distinct vertices in a digraph D. 
Show that the maximum number of edge-disjoint directed paths 
from s to t is equal to the minimum number of edges in a set E 
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whose deletion disconnects s and ¢ in that direction, i.e. such that 
every directed path from s to t contains at least one edge from E. 


Let s,t be two distinct vertices in a digraph D. An (s,t)-sep- 
arating set is a subset S C V(D) \ {s,t} so that any directed path 
from s to t contains at least one vertex of S. 


Theorem 32.4 (Menger’s Theorem). Let s,t be two distinct 
vertices in a digraph D and assume there is no edge directed from 
s tot. If no (s,t)-separating set has size less than k, then there 
exist k internally disjoint directed paths from s to t in D. 


PROOF: We construct a transportation network N whose vertices 
are s, t, and a pair of new vertices 71,22 for each vertex x of D 
other than s,t. For each edge of D directed from s to a vertex x of 
D, there is an edge of N directed from s to x, of infinite capacity 
(or any integer capacity > k). For each edge of D directed from a 
vertex x of D to t, there is an edge of N directed from x2 to t of 
infinite capacity. For each edge e of D directed from x to y, where 
x,y are vertices of D other than s,t, there is an edge directed from 
X2 to y, of infinite capacity. Finally, for each vertex x of D other 
than s,t, there is an edge directed from x; to x2 of capacity 1. 

Note that, by construction, a directed path from s to t in N must 
have vertex terms of the form 


S, Q1, Aa, bi, bo, ..., 21, 2a, 


where s,a,b,...,z,t are the vertex terms of some directed path 
from s to t in D. 

Suppose we have an integral flow f from s to tin N of strength k. 
The values of f can only be 0 and 1, because every edge of N either 
enters a vertex x, for which the only edge leaving has capacity 1, 
leaves a vertex 2X2 for which the only edge entering has capacity 1, 
or is an edge x1 — 2X2 of capacity 1. By Problem 32B, there exists a 
set of & directed paths from s to tin N that pass through any edge 
e at most once. These paths are edge-disjoint and hence internally 
disjoint, because two directed paths sharing a vertex x; or ®2 must 
share that single edge x; — 22. So the corresponding k paths in D 
are internally disjoint. 
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To ensure that such a flow f exists, we consider a cut (S,7) in 
N separating s and t and suppose, for contradiction, that the cut 
has capacity < k. The only edges of N with capacity possibly less 
than & are those directed from x; to x2 for some x € V(D). If we 
let V be the set of vertices x of D such that 7; € S and x2 € T, 
then the capacity of the cut is |V|. Every directed path from s to t 
in N must pass through one of these edges 7, — x. Equivalently, 
every directed path in D must pass through a vertex x of V, so V 
is an (s,t)-separating set. If the hypothesis of the theorem holds, 
we have a contradiction. 


A convex polytope P in R” is the convex hull of finitely many 
points, i.e. 


P={Aixit--++ Aux: XG > 0, Ar te-: + AR = 1}. 


The dimension of a polytope is the dimension of its affine span. 
A supporting hyperplane of P is a hyperplane 


He xe ac) eh 


(where is a nonzero linear functional R” — R and © a scalar) 
with HM P # — but such that P lies entirely in one of the closed 
halfspaces of H, say 


PC {x: (x) > c}. 


A face of P is the intersection of a supporting hyperplane of P with 
P (though the empty set is often included as a face). A vertex of P 
is a 0-dimensional face; an edge of P is a 1-dimensional face. The 
graph G of a polytope P (also called the 1-skeleton of P) has V(G) 
and E(G) equal to the vertices and edges, respectively, of P with 
incidence determined by inclusion. 

A face F' is a convex polytope in its own right; indeed, it is the 
convex hull of the vertices of P that it contains (see Griinbaum, 
1967). The graph of F is a subgraph of the graph of P. 
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The following result is due to M. Balinski (1961). For the proof, 
we require the following fact from the theory of linear programming; 
see Griinbaum (1967). If s is a vertex of a polytope P in R™” and ¢ 
a linear functional, either ¢(s) > (x) for all x € P or there exists 
a vertex t of P adjacent to s such that ¢(t) > &(s). 


Theorem 32.5. The graph G of an n-dimensional convex polytope 
is n-connected. 


PROOF: We will use induction on n. A O0-dimensional polytope 
consists of a single point. The graph of a 1-dimensional polytope 
consists of two vertices and an edge joining them. 

Let £1, %9,...,%p_1,a,b be distinct vertices in the graph G of 
a convex polytope P of dimension n. We may assume P C R”. 
We want to find a walk in G from a to b that avoids the vertices 
L1,-+-,;Lp—1. Choose an (n—2)-dimensional affine space S$ that con- 
tains %1,...,2%nj_1. Some hyperplane H, i.e. an (n — 1)-dimensional 
affine subspace, on S' is such that both a and b are on the same 
side of H, i.e. H = {z: €(z) =a} and &(a) > a, €(b) > a, for some 
linear functional @ on R” and some scalar a. (We allow the case 
that one or both points are in H.) 

Since P ¢ H, we can assume P contains points z with &(z) > a 
(otherwise a and b are in H and we can replace @ and a by their 
negatives—i.e. consider the other side of H). 

By the above fact from linear programming, there exist paths p 
and q in G from a and b to vertices a’ and b’, respectively, at which 
é attains its maximum value a’, and such that no vertices of p and 
q other than their initial vertices lie in H. The intersection PMH", 
where H' := {z : €(z) = ’}, is a face of P of dimension at most 
nm — 1, and in particular contains a connected graph, so there is a 
path r in G from a’ to b! with vertices in H’. We concatenate p, r, 
and the reverse of q to find a walk from a to b. 


Problem 32D. A graph G is said to be k- Tutte-connected when 
for every £ < k, it is not possible to find a pair (H, Kk) of nonempty 
edge-disjoint subgraphs of G, each containing at least @ edges, such 
that HU K = G and |V(H)N V(K)| = @. Such a pair of sub- 
graphs may be called an ¢-separation of G. The concepts of vertex- 
connectivity and Tutte-connectivity differ. But it is easy to see, for 
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a nonnull graph, that 1-Tutte-connectivity is the same as ordinary 
connectivity and that 2-Tutte-connectivity is the same as nonsepa- 
rability. A 2-Tutte-connected graph with at least two edges cannot 
have any loops, since otherwise we have two edge-disjoint subgraphs 
(one a loop-graph and the other with all other edges), each with at 
least one edge, but with only one vertex in common. Prove that for 
a graph G with |V(G)| > k+1, G is k-Tutte-connected if and only 
if G is k-vertex connected and has no polygons with fewer than k 
edges. 


Notes. 

See the notes to Chapter 34 for some remarks on H. Whitney. 

Karl Menger (1902-1985) was an Austrian mathematician who 
left Austria in 1937 for the United States, where he spent the rest 
of his life. He worked in many fields, including logic, didactics, and 
economics, but is best known for his work on dimension theory and 
curve theory. After his Ph.D., he worked with L. E. J. Brouwer 
in Amsterdam to return to Vienna as professor in 1927. There 
he founded the Vienna Mathematical Colloquium that had such 
famous members as K. Gédel and A. Wald and a great number of 
visitors. Their results were published in eight volumes before the 
seminar was forced to stop because of its many Jewish contributors. 
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Planarity and coloring 


We start with some material on deletion and contraction. We shall 
see in the next chapter that these are ‘dual’ concepts; cf. Problem 
34C. 

Let G be a graph and e € E(G). We define two graphs G’, and 
G” with the same edge set E(G)\{e}. The first G! is obtained from 
G by deleting the edge e. (It is sometimes convenient subsequently 
to remove any isolated vertices that may arise, but we shall allow 
them to remain.) The second G! is obtained by identifying the 
endpoints of e and removing e itself. We can think of ‘reeling in’ 
or ‘shrinking’ an edge in a drawing of a graph as in Fig. 33.1. 


Poke K+ PK 
Figure 33.1 


The chromatic polynomial xq of a finite graph G was introduced 
in Chapter 25: y¢(A) is the number of proper colorings of G with 
A colors. For example, note that 


XK, (A) = AQ — 1)(A - 2)---(A-—n+1). 


That yg is a polynomial in A was shown in Chapter 25—also see 
the remark following (33.1). 


Example 33.1. We claim that for a tree T' on n vertices, 


xr(A) = (A= 1)72. 
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To see this, observe that if x is a monovalent vertex of a tree T, 
there are \—1 colors available for x after the tree on n — 1 vertices, 
obtained by removing 2, is given any proper coloring. The claim 
follows by induction on the number of vertices. 


If e is an edge of G, not a loop, we can break the proper colorings 
of Gi‘, into two classes: those in which the endpoints of e have 
different colors (these are proper colorings of G') and those in which 
the endpoints of e have the same color (and these are in one-to-one 
correspondence with the proper colorings of G’”’). This establishes 


(33.1) xa(A) = xa, (A) — xe (A)- 


The equation (33.1) provides another way of seeing that y¢(A) is 
a polynomial in A. A graph H with loops has no proper colorings, 
i.e. yy (A) = 0. For an edgeless graph H on n vertices, yH(A) = X”. 
These are both polynomials. Now use (33.1) and induction on the 
number of edges. 

The chromatic polynomial of a tree T could also be obtained by 
induction and (33.1): T.’ is a tree and T! has two components that 
are trees. 


Problem 33A. Let 7(G) denote the number of spanning trees in 
a graph G (this is sometimes called the complexity of G). Show 
that for any nonloop e € E(G), r(G) = 7(G1) +. r(G%). 


Problem 33B. Find the chromatic polynomial of the n-gon C;,. 
Find the chromatic polynomial of the n-wheel W,, (this is the graph 
obtained from C,, by adding a new vertex and joining it to all 
vertices of C,,). 


It makes no sense to ask for the number of proper colorings of 
a graph with —1 colors, but since yq(A) is a polynomial, we can 
evaluate yg(—1). R. P. Stanley (1973) discovered the combinatorial 
interpretation of |yg(—1)| that follows. An orientation of a graph 
is one of the directed graphs arising by choosing for each edge one 
of its endpoints to be the head; the other is the tazl. (So the number 
of orientations of a graph is 2” where m is the number of edges that 
are not loops—although for topological purposes, there are reasons 
why a loop should also be considered to admit two directions.) 
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Theorem 33.1. The number of acyclic orientations, i.e. orien- 
tations in which there are no directed circuits, of a graph G is 


DEO xe): 


Proor: If G has loops, then yg(A) = 0 and G has no acyclic 
orientations, so the theorem holds in this case. 

Consider an acyclic orientation of G’, where e is not a loop. There 
are two ways to direct e to get an orientation of G. We claim that 
one or both of them are acyclic, and that the cases where both are 
acyclic are in one-to-one correspondence with acyclic orientations 
of G”’. This is because directing e from one end zx to the other y 
produces a nonacyclic orientation if and only if there is a directed 
path from y to x in G/; there cannot be directed paths from x to y 
and also from y to x in G’, since we assumed an acyclic orientation; 
and there is no directed path from y to x in G’, if and only if 
identifying x and y produces an acyclic orientation of G”. 

Thus 

w(G) = w(G.) + w(G2) 


when w(H) is taken to be the number of acyclic orientations of a 
graph H. But by (33.1), this same recursion is also satisfied when 
w(H) is taken to be (—1)!")ly7(—1). The theorem follows by 
induction on the number of edges after we check that it is valid for 
edgeless graphs. 


By an embedding or proper drawing of a graph G on a surface 
S, we mean a drawing where the edges do not cross but meet only 
at vertices. More precisely, we mean a representation of the graph 
where the vertices correspond to points in S, the edges correspond 
to Jordan arcs (continuous one-to-one images of the unit interval) 
in S that join the points of S corresponding to the ends of the edge 
in G, and where no internal point of any of the Jordan arcs is a 
point of any other Jordan arc or one of the points corresponding 
to the vertices of G. We consider only embeddings of finite graphs. 
While the plane is not compact, it can be naturally embedded in 
the sphere (its one-point compactification) and a graph is planar if 
and only if it admits an embedding on the sphere. 

Examples of graphs that are not planar include K33 and Ks; we 
will mention several ways to see this later. Subdivisions of these 
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graphs, and any graph containing one of these subdivisions, will also 
not be planar. An important theorem that we shall not prove here 
is the following. Proofs may be found in Chartrand and Lesniak 
(1986), Tutte (1984), and Diestel (1997). 


Theorem 33.2 (Kuratowski). A graph G is planar if and only 
if it has no subgraphs isomorphic to subdivisions of Ks or K33. 


Problem 33C. Find a subdivision of K33 as a subgraph of the 
Petersen graph. 


A minor of a graph G is a graph H obtained from G by delet- 
ing and contracting edges, and then possibily deleting some or all 
isolated vertices. 

It can be seen that that the order in which edges are deleted 
or contracted is not important. For example, if the edges of a 
subset S C E(G) are contracted in any order, we obtain a graph 
isomorphic to the graph G‘%, called a contraction of G, defined as 
follows. The vertices of G4 are, formally, the components of the 
spanning subgraph, denoted G:S, of G with edge set S. The edge 
set of GG is E(G) \ S. The ends of an edge e € E(G) \ S are, in 

%, to be the components of G:S that contain the ends of e as an 
edge of G. Informally, the components of G:S' are being shrunk to 
single vertices. 

We use the notation G’, for the graph obtained by deleting the 
edges in S, ie. G:(E(G) \ S), but it is convenient to require that 
any isolated vertices are also deleted when using this notation. 


Example 33.2. Let A be the five ‘spokes’ in the usual drawing of 
the Petersen graph P, on the left of Fig. 1.4. The subgraph P:A 
has five components, so P/ will have five vertices. The contraction 

% in this case is Ks. If B is the set of five edges in the interior 
pentagram, then P:B has six components and P3 is the wheel Ws. 
If C is the edge set of a spanning tree of P, then P% consists of a 
single vertex and six loops. (A graph with one vertex may be called 


a cotree.) 


Note that it is quite likely that G'j will have loops and/or parallel 
edges even if G is simple. (Those contractions of a simple graph 
G that have no loops are in one-to-one correspondence with the 
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elements of the lattices H(G), sublattices of the partition lattices, 
introduced in Example 23.6). 

A graph G is a contraction of any of its subdivisions H, since 
contracting any k — 1 edges of a path-graph with k edges produces 
a link-graph. 

If we have a drawing of a graph G on a surface S and e € E(G), it 
is clear that there is a natural drawing of G’, on S. Ife is a nonloop, 
there is also a natural induced drawing of G” on S: the ends of the 
edge can be continuously brought closer on the surface (see Fig. 
33.1). (According to our combinatorial definition, contracting a 
loop is the same as deleting it.) In particular, if G admits an 
embedding on a surface S, then so does every minor of G. 


Problem 33D. Show that a graph G that has a minor isomorphic 
to Ks or K33 also contains a subdivision of Ks or 33. Somewhat 
more generally, prove: 

(i) if a trivalent graph H occurs as a minor of a graph G, then G 
contains (as a subgraph) a subdivision of H. So if K33 is a minor, 
then a subdivision of K33 is a subgraph; 

(ii) if Ks is a minor of G, then G contains a subdivision of Ks or 
a subdivision of K3 3. 


A deep and extremely important theorem due to N. Robertson 
and P. Seymour states that in every infinite set of graphs there are 
two such that one is a minor of the other. See Chapter 12 of Diestel 
(1997) for a discussion of part of this work. It follows that the set of 
graphs that do not embed on a given surface and are minimal with 
respect to this property (every proper minor being embeddable) is 
finite. For the plane, there are two. 

Given an embedding of a graph G on a surface S, we define 
the faces or regions with respect to the embedding to be the topo- 
logically connected components that result when the vertices and 
edges (more precisely, the points corresponding to vertices and sets 
of points in the Jordan arcs corresponding to edges) are removed 
from S. We denote the set of faces by F'(G). We realize that this 
is horrible notation, since the faces depend on the embedding (in 
particular, on the surface) and are not in general determined by G. 
But we will use it anyway. 
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In general, these faces may or may not be 2-cells (homeomorphic 
to the open unit disk, i.e. without ‘holes’). But for an embedding of 
a connected graph on the sphere, the regions will be 2-cells. This 
fact is related to the Jordan curve theorem which asserts that if 
the points of a one-to-one continuous image of the unit circle in the 
sphere are removed, the resulting topological space has exactly two 
simply connected components that are 2-cells. 


Theorem 33.3 (Euler’s formula). For a drawing of a connected 
planar graph G on the sphere or plane, 


fHe+e=2 


where f, e, and v are, respectively, the numbers of faces, edges, and 
vertices. More generally, for any graph G embedded on the sphere, 


(33.2) 1—|F(G)| + |E(G)| — |V(G@)| + |C(G)| = 0, 


where C(G) denotes the set of components of G. 


PROOF: We prove the latter assertion by induction on the num- 
ber of edges of G. For an edgeless graph, |C(G)| = |V(G)| and 
|F'(G)| = 1, so (33.2) holds. 

Suppose G has a nonloop a. Consider an embedding of G and 
note that contracting an edge will not affect the number of faces. 
Apply the induction hypothesis to G” to get 


1 — |F(Ga)| + |E(G.)| — |V(Ga)| + |C(G)| = 0. 


We have |E(G)| = |E(Gq)| + 1, [V(G)| = |V(Ga)| +1, |C(@)| = 
IC(G")|, and also |F'(G)| = |F'(G")|. So (33.2) holds for G. 

Suppose G has loops; let a be one of these, and apply the induc- 
tion hypothesis to G’, to get 


1 — |F(G,)| + |E(Ga)| — |V(Ga)| + |C(Ga)| = 9. 


Of course, |E(G)| = |E(Gq)| + 1, |V(@)| = |V(G;)|, and |C(G)| = 
|C(G',)|. On the sphere, the Jordan curve theorem implies that the 
two faces incident with e are distinct, so |F(G)| = |F(G'!,)| +1 and 
we obtain (33.2) for G. 
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Example 33.3. We show that Ks and K33 are not planar. The 
set of edges incident with a face in a planar embedding of a non- 
separable graph G (other than the link-graph) contains a polygon 
subgraph; see Chapter 34. The degree of a face is the number of 
incident edges (i.e. its degree as a vertex of the dual graph of Chap- 
ter 34). The sum of the degrees of the faces is twice the number of 
edges, since every edge is incident with two faces. 

Consider a hypothetical embedding of K; in the plane. By Eu- 
ler’s formula, there are seven faces. But each has degree at least 
3, so the sum of the degrees is at least 21, which, of course, is a 
contradiction to |E(k5)| = 10. 

Similarly, there would be five faces in a hypothetical embedding 
of A33 in the plane, each of degree at least 4 since K33 has girth 
4. Then the sum of the degrees would be at least 20, contradicting 
|E(K3,3)| = 9. 

(A similar argument shows that the Petersen graph is not planar. 
But we already knew this since both Ks; and K33 occur as minors 
of the Petersen graph and they are not planar.) 


Problem 33E. Determine all pairs (d,, dz) of integers with d; > 2, 
i = 1,2, so that there exists a planar graph (not necessarily simple) 
that is regular of degree d; and such that all faces have degree do. 
(This will include the degrees of the vertices and faces of the five 
Platonic solids.) 


E. Steinitz (1922) characterized the graphs of 3-dimensional con- 
vex polytopes. 


Theorem 33.4. A graph G is the graph of a 3-dimensional convex 
polytope if and only if G is simple, planar, and 3-connected. 


That the graph of a 3-dimensional polytope is 3-connected is a 
special case of Theorem 32.5. We do not prove the rest of the 
theorem. 

The question “Can the countries of a map on the sphere be col- 
ored with four or fewer colors so that adjacent countries are colored 
differently?” , occurs in a letter dated October 23, 1852, from Au- 
gustus de Morgan to Sir William Rowan Hamilton. It is likely that 
the originator of the question was the brother, Francis Guthrie, of 
a student, Frederick Guthrie, of de Morgan. 
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Coloring the countries (regions) of a planar map is equivalent to 
coloring the vertices of the dual graph that we will introduce in the 
next chapter. So for us, the ‘four color problem’ is the question of 
whether every planar graph is 4-colorable. 

Four colors are clearly necessary since Ky is planar, but it is easy 
to find many other planar graphs G that cannot be 3-colored. This 
problem has motivated a great deal of graph theory. More on the 
history of the four color problem may be found in Biggs, Lloyd, 
and Wilson (1976). 

In 1890, P. J. Heawood proved the Five Color Theorem, Theo- 
rem 33.6 below: a loopless planar graph G has chromatic number 
x(G) <5. Heawood used ideas of A. B. Kempe (in particular, the 
recoloring idea reviewed below). In 1976, K. Appel and W. Haken 
finally settled the four color problem with an announcement of a 
proof of the Four Color Theorem: .(G) < 4. Surprisingly, perhaps, 
their proof did not use the vast quantity of work and theories de- 
veloped in this century but went back to the ideas of Kempe and 
Heawood. Another surprise, perhaps, is that their proof required 
over 1000 hours of computer time. It was not clear when they be- 
gan that the computer would ever finish. As an oversimplification, 
there were many cases to consider but it was not certain that their 
number would be finite; the computer was programmed to gener- 
ate cases itself and could conceivably have gone on forever breaking 
cases into subcases. See Appel, Haken, and Koch (1977), and Ap- 
pel and Haken (1977). We have no space to discuss this work in 
any detail, but will prove the Five Color Theorem below. 


Proposition 33.5. Every nonnull simple planar graph G has a 
vertex of degree at most 5. 


PROOF: This follows quickly from Euler’s formula. It suffices to 
prove that a component has such a vertex, so assume G' is con- 
nected. Suppose a drawing of G has f regions, e edges, and v 
vertices. If no vertex of degree less than six exists, then the sum of 
the v degrees of the vertices, which is 2e, is at least 6v; that is, 
Os 


w| a 


Since G is simple (and we may assume it is not a link-graph with 
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two vertices joined by a single edge), each region is incident with 
at least three edges, so the sum of the f degrees of vertices of G*, 
which is 2e, is at least 3v; that is 


But then we have the contradiction 


e 2e 
a= —_— ft — =); 
ts ear ee 


Of course, coloring a loopless graph is equivalent to coloring a 
simple graph (just suppress the multiple edges). An immediate 
consequence of the above result is the Six Color Theorem: a loopless 
planar graph G has y(G) < 6. When a vertex x of degree at most 
five is deleted from a simple planar graph and the result colored 
with six colors inductively, there is always a color available for x. 

We recall the idea of recoloring introduced in Chapter 3. Suppose 
G is properly colored and let a and G be two of the colors. The 
colors a and (3 can be switched on any component of the subgraph 
of G induced by the vertices of colors a and 7 to obtain another 
proper coloring. So given two vertices x and y of colors a and 
GB, respectively, either there is a path (of odd length) with vertex 
terms ay = £,Q1,...,@, = y that alternate colors a and ( (a so- 
called Kempe chain), or there is a proper coloring in which both x 
and y have color a, say, while vertices of colors other than a and 
G retain the same color. 


Theorem 33.6. If G is a loopless planar graph, then x(G) < 5. 


PROOF: We use induction on the number of vertices. The assertion 
is surely true for graphs with at most five vertices. Consider a 
planar drawing of a loopless graph G and assume the theorem holds 
for all graphs with one vertex less. We can assume G is simple. 
By Proposition 33.5, there exists a vertex « € V(G) of degree 
<5. Let G, denote the graph obtained from G by deleting the 
vertex x and the incident edges. We have a natural planar drawing 
of G, obtained from the drawing of G. By the induction hypothesis, 
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G, has a proper coloring with five colors. There is a color available 
for x, and we are done, unless x has degree exactly five and the 
vertices y1, Y2,---,Y5 (which we write in the cyclic order given by 
the drawing) adjacent to x in G have received five distinct colors 
in the coloring of G,;. So we assume that y; has received color i, 
4=1,2,3,4,5. See Fig. 33.2. 

Consider colors 1 and 3. Either there exists a proper coloring of 
G, in which y, and y3 both have color 1 (while yo, yz, and ys retain 
their original colors) or there exists a simple path with vertex terms 


(33.3) Yi, 41, by, Aa, be, ..., as, Ds, Y3; 


say, in G, with vertices alternately colored 1 and 3. In the first 
case, x may be assigned color 3 and we are done. Similarly, either 
there exists a proper coloring of G, in which yg and y4 both have 
color 2 (while yi, y3, and ys retain their original colors) or there 
exists a simple path with vertex terms 


(33.4) Y2, C1, d1, C2, do,..., Cr, dt, Ya, 


say, in G, with vertices alternately colored 2 and 4. In the first 
case, x may be assigned color 4 and we are done. 


To complete the proof, we note that both paths above cannot 
exist. If the path in (33.3) exists, then 


(33.5) iy G10) 09,0955 epg, Vea, 


is the vertex sequence of a simple closed path, which divides the 
plane into two regions. The vertex yo is in one region while y, is 
in the other. Any path from y2 to y4 in G, must necessarily have 
a vertex in common with the path in (33.4) and in particular must 
have a vertex of color 1 or 3; thus a path as in (33.5) cannot exist. 
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Y1 v4 


Figure 33.2 


In Chapter 17, we treated list-colorings of graphs to explain 
Galvin’s proof of the Dinitz conjecture. We shall now treat a result 
of C. Thomassen (1994) that provides a second proof of the five 
color theorem by first giving each vertex of the graph a list of five 
different colors, not necessarily the same list for each vertex. One 
of the ideas of the proof should be especially noted because it often 
can be used to make proofs easier. It involves making an induction 
hypothesis more stringent with the effect that the induction step 
goes more easily. 


Theorem 33.7. Let G be a planar graph with N vertices v, 
Ug,...,Un. Fori=1,...,N let S; be a set of five elements (which 
we call colors). There exists a mapping f on the vertices, with 
f (vi) € S; such that f(u;) € f(v;) for all adjacent pairs v;,v; (a five 
list-coloring of G). 


PROOF: We first make the problem harder by adding extra edges 
which change G into a triangulated polygon. Let the numbering be 
such that v1, v2,...U, are the succesive vertices of the polygon (see 
Fig. 33.3). For each of these vertices remove two colors from its list 
so that vertices on the perimeter of the polygon only have lists of 
three available colors. Again, the problem is more difficult. Finally, 
restrict matters even further by selecting two adjacent vertices on 
the perimeter and assigning to each a color from their list. We claim 
that with these initial conditions, the coloring can be completed. 
The proof is by induction on the number of vertices (small cases 
are trivial). 
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Figure 33.3 


For the induction, we distinguish two cases. In the first, we 
assume that the polygon has an edge connecting two vertices on 
the perimeter that are not adjacent on the perimeter, i.e. a chord. 
The chord divides the polygon into two parts, one of which has 
both of the precolored vertices (one possibly as an endpoint of 
the chord). The coloring of that part can be completed by the 
induction hypothesis. From this coloring, the other part inherits a 
(pre)coloring of two adjacent vertices on the perimeter and again 
induction finishes the job. 

Now assume that there is no chord. Let v; and v2 be the pre- 
colored vertices on the perimeter and let v1, u1, U2,.--, Um, Ue—1, I 
that order (clockwise, see Fig. 33.3) be the vertices adjacent to vx. 
The vertices u; are interior points and because of the triangulation, 
there are edges (v1, U1), (U1, U2),---(Um—1, Um); (Um, Ue-1)- The list 
of vz; has at least two colors not equal to the color of v;. We pick 
(these) two and remove them from the lists of u; to um and, if nec- 
essary, further shorten these lists to size 3. By induction, we can 
complete the coloring of G\{v,}. Possibly v,_1; has obtained one 
of the two colors on v;’s list but there is one left to complete the 
coloring of G. 


Notes. 

A. B. Kempe (1849-1922) took a degree in 1872 with special dis- 
tinction in mathematics at Trinity College, Cambridge where he 
also gained a musical reputation with his fine countertenor voice. 
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He chose law as his profession while not relinquishing his mathe- 
matical studies. 

The Polish mathematician Kazimierz (or Casimir) Kuratowski 
(1896-1980) taught at Warsaw University (for nearly 40 years) and 
Lwow Polytech before retiring in 1966. His celebrated theorem 
characterizing planar graphs was proved in 1930. 

Ernst Steinitz (1871-1928) was professor of mathematics at Bres- 
lau (now Wroclaw) and Kiel. He made important contributions to 
field theory (with Hilbert). 

In 1993, M. Voigt found an example of a planar graph with 238 
vertices that is not four list-colorable. 
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34 
Whitney duality 


A Jordan arc divides a small neighborhood of one of its internal 
points into two ‘halves’. Thus in addition to being incident with 
two vertices (which may coincide), an edge in a map is incident 
with two regions (which may coincide). See Fig. 34.1. 
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Figure 34.1 


This allows us to define the dual graph G* of a graph G with 
respect to a 2-cell embedding of G on a surface S. The vertices 
V(G") are to be the regions with respect to the embedding; the 
edges F:(G*) are to be the same as the edges of G; the ends of an 
edge e in G* are to be the regions with which e is incident. Two 
examples are shown in Fig. 34.2. 

We have defined G* abstractly, but it is important to note that 
it admits an embedding on the same surface S: we can represent a 
region by a point in its interior; the interior points in two regions 
sharing an edge e can be joined by a Jordan arc crossing only the 
Jordan arc representing e in G. The drawing of the dual of the 
cube in Fig. 34.2 is shown in Fig. 34.3. 

Examples will convince the reader that (G*)* is isomorphic to G. 
Two other facts that we invite the reader to think about but do 
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not attempt to prove formally are: a graph that is not connected 
cannot have a 2-cell embedding; the dual graph with respect to a 
2-cell embedding is always connected. 


ends in G | edge | ends in G* 
1,2 a LI 
1,3 b LII 
2,3 Cc III 
3,4 d int 
4,4 e€ LI 


ends in G | edge | ends in G* 
1 2 12 a LI 
I 3,4 C LIV 
VI 4,1 d LV 
5,6 e II, VI 
6,7 f III,VI 
8,5 h V,VI 
1,5 i II,V 
2,6 9 ILI 
3,7 k IIL IV 
4,8 l IV,V 


Figure 34.2 


The planar duals of the series of polygons are the bond-graphs, 
i.e. the graphs with two vertices and any positive number of edges 
joining them. In particular, the loop-graph and the link-graph are 
dual. The planar dual of a tree is a cotree, and a planar dual of a 
cotree is a tree. 
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Figure 34.3 


In general, a graph may have several planar dual graphs. Fig. 
34.4 shows two isomorphic cotrees and their nonisomorphic planar 
duals with respect to the two embeddings. Fig. 34.5 shows two 


embeddings of a nonseparable graph G and the dual graphs with 
respect to each embedding, which are not isomorphic. 


Figure 34.4 
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The goal of this chapter is to understand the relation between 
a planar graph and its dual graph(s) ‘combinatorially’. We will 
use this understanding (Whitney duality) to prove some theorems 
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about planar graphs and their duals (without any reference to pla- 
nar drawings or topology). 

Recall that the incidence matrix N = N(G) of a graph G is 
the matrix whose rows are indexed by V(G), whose columns are 
indexed by E(G), and where 


1 if x is incident with a nonloop e, 
N(a,e)= 4 2. if e is a loop incident with x 


Q if z is not incident with e. 


We are going to work over the field F, of two elements, so loops 
essentially correspond to columns of all zeros. Every other column 
contains two ones and, in particular, the sum modulo 2 of all rows 
of N is the zero vector. 

We may consider the code C(G) of a graph G, which is the binary 
code generated by the incidence matrix N(G). A graph A is a 
Whitney dual of a graph G when E(G) = E(#) and the code C(G) 
of G is equal to the dual code C(H)+ of H. Clearly, H is a Whitney 
dual of G if and only if G is a Whitney dual of H. This definition 
requires that G and H have the same edge set, which is convenient 
notationally. But, informally, it is only necessary that the edges of 
G and H be labeled by the same set, or otherwise put in one-to-one 
correspondence. 

Note that the code of a polygon with n edges consists of all even- 
weight words of length n; the code of a bond-graph with n edges 
consists of the two constant words. The code of a tree consists of 
all words; the code of a cotree consists of only the zero word. 

Some terms in graph theory are used in many contexts, with 
varying meanings. In this chapter, we use ‘cycle’, ‘circuit’, ‘cutset’, 
and ‘bond’ of a graph G to mean certain subsets, described below, 
of the edge set E(G). Other uses of these terms the reader may 
know must be temporarily forgotten. 

As in Chapter 20, we use the term support of a codeword x to 
denote the subset of the indexing set for its coordinates that are 
nonzero in x. For binary codes, we can identify the codewords with 
their supports. The supports of words in C(G) are called cutsets of 
G; supports of words in C(G)+ are called cycles of G. The terms 
circuits and bonds are used, respectively, for minimal nonempty 
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cycles and cutsets. We use cycle space and cutset space to mean 
the set of all cycles and cutsets, respectively. Of course, these are 
vector spaces over the field Fj, with addition of two sets equal to 
their symmetric difference. 

It is easy enough to understand when a subset S C E(G) is a 
cycle, or a cutset. The binary sum of the rows of the incidence ma- 
trix indexed by vertices in X has support equal to the set x (X,Y) 
of edges that ‘cross’ from X to Y := V(G) \ X. ie. that have one 
end in X and one end in Y. That is, S is a cutset if and only if 
S = x(X,Y) for some X,Y. We use this notation only when X 
and Y partition the vertex set; we allow X or Y to be empty, in 
which case S is empty. 

A vector, with support S, say, is orthogonal over F) to a row of 
N indexed by a vertex x when and only when z is incident with an 
even number of edges in S, loops being counted twice. So the cycles 
are those subsets of E(G) so that every vertex has even degree in 
G:S, the spanning subgraph of G with edge set S. 


Example 34.1. Let G be the graph below. 


1 ss : 
b 
d 
4 C 3 
The incidence matrix of G is 
abede 
1 10 0 1 41 
2 1 10 0 0 
3 011041 
A 001 1 0 


There are four cycles in G: @, {a,b,e}, {c,d,e}, and {a, b,c, d}. 
The latter three are, in fact, circuits. There are eight cutsets. The 
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set {a, b,c,d} is both a cycle and a cutset; it is the support of the 
sum of rows 2 and 4. But {a,b,c,d} is not a bond; it contains a 
smaller cutset, e.g. the support {a,b} of row 2. The set {a,b} is a 
bond. Another example of a bond is {b,d,e}; it is the support of 
the sum of rows 1 and 2, or of 3 and 4. 


Problem 34A. (i) Let G be connected. Prove one of the equiva- 
lent statements: the number of cutsets of G is exactly 2!V(@I-!. the 
dimension of the cutset space of G is |V(G)| — 1. 

(ii) Prove, in general, that the dimension of the cutset space of G 

is |V(G)| —|C(G)|, where C(G) is the set of components of G. (As 
a corollary, the dimension of the cycle space is |E(G)| — |V(G)| + 
IC(G@)|.) 
Example 34.2. The cycle spaces of Kg and of the Petersen graph 
are binary codes of length 15 of dimensions 10 and 6, respectively. 
(Fig. 35.1 in the next chapter shows that Kg is the dual of the 
Petersen graph with respect to an embedding in the real projective 
plane, but they are not Whitney duals, since the dimensions of 
their codes sum to more than the number of edges.) The minimum 
distances of these codes are 3 and 5, respectively. We remark that 
the vector of all 1s can be added to a generating set for these 
two codes and the minimum distances remain 3 and 5 while the 
dimension goes up by 1 in both cases. 


Theorem 34.1. Let G be a graph. 

(i) A set S C E(G) is a circuit if and only if it is the edge set of 
a polygon subgraph of G. 

(ii) Assume G is connected. A cutset x(X,Y) is a bond of G 
if and only if the subgraphs of G induced by X and Y are both 
connected. 


PROOF: Clearly the edge set of a polygon is a minimal nonempty 
cycle. A nonempty cycle is certainly not the edge set of a forest 
(since every tree has monovalent vertices, say) and so contains the 
edge set of a polygon; so a minimal nonempty cycle is the edge set 
of a polygon. 

Let S = x(X,Y). If the subgraphs induced by X and Y are con- 
nected, then S cannot properly contain a nonempty cutset because 
deleting any proper subset of the edges in S does not produce a 
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disconnected graph. If the subgraph induced on Y, say, is not con- 
nected, there is a partition of Y into nonempty subsets Y, and Y 
with no edges between them. Then $” := x(X UY}, Y2) is contained 
in S. Connectivity implies that there are edges between X and Yj 
and X and Y2, and this means S’ is both nonempty and a proper 
subset of S. 


The reader may check that a set of edges of an arbitrary graph G 
is a bond of G if and only if it is a bond in one of the components 
of G. Also note that a subset S of edges of a graph G is a bond 
of G if and only if the deletion of the edges in S results in a graph 
with more components than G (e.g. a disconnected graph, if G 
is connected to start with) and S is minimal with respect to this 
property. 

Given a set S of edges of a graph G, S is a circuit if and only if 
deleting the edges not in S (and any isolated vertices that result) 
produces a polygon; S' is a bond if and only if contracting the edges 
not in S (and deleting any isolated vertices that result) produces a 
bond-graph. 

Notice that the set S(a) of edges of a graph G that are incident 
with a vertex x, not including loops, is always a cutset. These 
span the cutset space—they are the supports of the rows of the 
incidence matrix. In view of Theorem 34.1(ii), S(a) is a bond for 
every « € V(G) if and only if G is 2-connected (or is a connected 
graph with two vertices). 


Problem 34B. Show that the circuits of a graph span the cycle 
space, and the bonds span the cutset space. To do this, just verify 
that the nonzero codewords in a binary linear code C, whose sup- 
ports are minimal among the supports of all nonzero codewords, 
generate the code. (This is true for linear codes over any field.) 
Hence G and H are Whitney duals when the circuits of G are 
exactly the bonds of H, or when the bonds of G are exactly the 
circuits of H. 


Example 34.3. The two graphs described by each table in Fig. 
34.2 are Whitney duals. This is because each pair consists of 
planar duals (see Theorem 34.2 below). For example, {a,e,g,c} 
and {a,j,b} are bonds in the cube G and circuits in G*; the set 


384. Whitney duality 479 


{a, b,c, d} is a circuit in G and a bond in G*. 


Problem 34C. Let G and H be Whitney duals with E(G) = 
E(H) = E. Explain why G! and H! are Whitney duals for any 
ea ee a 


Problem 34D. Let G and H be Whitney duals with E(G) = 
E(#H) = E, both of which are connected. Show that S is the edge 
set of a spanning tree in G if and only if E \ S is the edge set of a 
spanning tree in H. 


Theorem 34.2. A graph G is planar if and only if it has a Whitney 
dual. A dual G* of G with respect to any planar drawing of G is a 
Whitney dual of G. 


We prove only the second sentence (which implies the ‘only if’ 
part of the first sentence) at this time, and postpone the other part 
until the very end of the chapter. 


PRooF: Let G* be a planar dual of G with respect to some planar 
drawing of G. 

Let S be a circuit of G. So S is the edge set of a polygon subgraph 
P of G that in the drawing, by the Jordan curve theorem, divides 
the sphere or plane into two parts; let F; be the set of faces inside, 
and F» the set of faces outside, P. This is a partition of the vertices 
of G*. Clearly, each edge of P is incident with one face of F; and 
one of Fy. Any other edge of G lies (except possibly for its ends) in 
one of the parts and so is incident only with members of F; or Fo. 
In summary, S = x(F,, F2) is a cutset in G*. The circuits generate 
the cycle space of G, so we have shown that the cycle space of G is 
contained in the cutset space of G*. 

Now we note the fact that the cutset space of G* is contained in 
the cycle space of G for an embedding on any surface. There is a 
cyclic ordering to the edges incident with with a given vertex (loops 
appear twice). Partition the vertices of G* (the regions or faces)— 
say they are colored red and blue—and consider the cutset C' of G* 
consisting of edges incident with one red and one blue region. The 
number of edges of C’ incident with a vertex of G is certainly even; 
see Fig. 34.6. (This is the same as saying that if the vertices of a 
polygon are colored red and blue, then the number of edges with 
ends of different colors is even.) So C is a cycle in G. 
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Hence the cycle space of G is equal to the cutset space of G*. 


Figure 34.6 


Euler’s formula is a corollary of what we have proved so far. 
Problem 34A says the dimension of the cycle space of G is |E(G)|— 
(\V(G)| —|C(G)]), while the dimension of the cutset space of G* is 
|F'(G)| — 1 (we have remarked that G* is always connected). When 
we equate these, we obtain (33.2). 

Whitney’s Theorem allows a quick proof of the following theorem 
of S. MacLane (1937). 


Theorem 34.3. A graph G is planar if and only if the cycle space 
of G has a basis B = { By, Bo,..., By} so that each edge of G occurs 
in at most two members of B. 


PrRooF: If G is planar, the boundaries of all but one face will pro- 
vide such a basis. For if G* is the dual of G with respect to some 
embedding on the plane, then, since G* is connected, the sets of 
all nonloops incident with all but one of the vertices of G* (faces 
of G) provide a basis for the cutset space of G* (cycle space of G) 
and each edge is an element of at most two of these sets. 

Now assume the cycle space of G has a basis as in the statement 
of the theorem. Let By = Bi + B)+---+ By. Any edge of G is in 0 
or 2 of the sets B;, 0 <i <k. Define a graph H whose vertices are 
0,1,...,%, whose edge set is E(G) and where an edge e is incident 
in H with the two integers j and ¢ so that e € B; and e € By when 
e is in two of the sets B;, and where e is a loop in H (at any vertex 
i) if e is in none of the sets B;. We claim H is a Whitney dual of 
G. 

The set of nonloops of H that are incident in H with a vertex 7 is 
B;, of course, and this is a cutset in H. Since the Bjs generate the 
cycles of G, the cycle space of G is contained in the cutset space of 
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H. But we know that the cycle space of G has dimension k, while 
the cutset space of H has dimension < k, since H has k+1 vertices. 
We conclude that the cycle space of G is equal to the cutset space 
of H, i.e. they are Whitney duals. Hence G is planar by Theorem 
34.2, 


As promised, we now give some results about planar graphs and 
their duals using Whitney duality (and not referring to the plane 
or sphere at all). 


Theorem 34.4. If G and H are Whitney duals and H is con- 
nected, then G is bipartite if and only if H is Eulerian (admits an 
Eulerian circuit). 


PROOF: G is bipartite if and only if the entire edge set E(G) is 
a cutset of G. Since G* is connected, it is Eulerian if and only if 
E(G) is a cycle of G*; see Theorem 1.2. The two conditions are 
equivalent since G* is a Whitney dual of G. 


A Tait coloring (after the English mathematician P. G. Tait) of a 
trivalent graph is a coloring of the edges with three colors, say a, (3, 
and y, so that each color appears at every vertex. Fig. 34.7 shows 
such a coloring. The existence of a Tait coloring of a trivalent graph 
is equivalent to partitioning the edges into three perfect matchings 
(cf. Chapter 5) and so a bipartite trivalent graph has a Tait coloring 
by Problem 5A. 


Figure 34.7 


Theorem 34.5. If G and H are Whitney duals and G is trivalent, 
then G admits a Tait coloring if and only if H is 4-colorable. 
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PROOF: We claim that a trivalent graph G admits a Tait coloring 
if and only if there exist two cycles S; and S) in G so that E(G) = 
S;US. Given the coloring, take 5; to be the union of the edges of 
colors a and y, and S» to be the union of the edges of colors @ and 
y. These are both cycles since every vertex is incident with two 
edges of S; and two edges of Sj. Conversely, if E(G) is the union 
of two cycles 5S; and 5S», color an edge e with color 


a ife€ Si,e¢ So, 
B ife € S,,e € So, 
ey ife € Sy N So. 


Every vertex x has even degree (between 0 and 3) in the spanning 
subgraphs with edge sets S; and S29; but E(G) = S; U Sp» implies 
that both degrees are 2 and that there is one edge of each color 
incident with x. 

Our second claim is that a graph H can be properly 4-colored if 
and only if E() can be expressed as the union of two cutsets of H. 
If E(#) is the union of x (X41, X2) and x(¥1, Y2), then XN Yi, X1N 
Yo, X2M Y,, X2M Yo are the color classes of a proper 4-coloring. If 
A, B,C, D are the color classes of a proper 4-coloring, then E(#) 
is the union of x(AU B,C UD) and x(AUC,BUD). 

The two observations and the definition of Whitney duals com- 
plete the proof. 


An isthmus of a graph is an edge whose deletion increases the 
number of components (i.e. disconnects the graph if it was originally 
connected). This is the dual concept of a loop. An isthmus is an 
edge e so that the singleton set {e} is a cutset; a loop is an edge e 
so that the singleton set {e} is a cycle. (Some authors use acyclic 
edge for an isthmus since it is an edge contained in no circuits. The 
term bridge has also been used.) 


Problem 34E. Show that a trivalent graph with an isthmus does 
not have a Tait coloring. 


In view of the Four Color Theorem, every isthmus-free trivalent 
planar graph does admit a Tait coloring. (An isthmus in G would 
be a loop in the dual which could then not be properly colored.) 
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Planarity is necessary, since the Petersen graph is an example of 
an isthmus-free trivalent graph which has no Tait coloring. 

Conversely, we claim that to prove the Four Color Theorem it 
would be sufficient to establish that every isthmus-free trivalent 
planar graph does admit a Tait coloring. For suppose that the re- 
gions of every isthmus-free trivalent planar graph G can be properly 
4-colored. Then, given a planar graph G, replace each vertex x of 
degree 4 3 by the configuration indicated for the case of a vertex of 
degree 5 in Fig. 34.8. (This may be done even for vertices of degrees 
1 and 2.) This creates more regions (more vertices in the dual G*) 
but preserves adjacency of the original regions, so if the regions of 
this larger but trivalent graph can be 4-colored, then we obtain a 
coloring for the original graph. By Theorem 34.5, the regions of 
the larger but trivalent graph can be 4-colored if and only if it has 
a Tait coloring. 


Figure 34.8 


Tait conjectured in 1880 that every 3-connected planar trivalent 
graph has a Hamiltonian circuit, which would have implied that the 
graph admits a Tait coloring. W. T. Tutte (1956) found a coun- 
terexample but did prove that every 4-connected planar trivalent 
graph is Hamiltonian. 


Theorem 34.6. If G and H are Whitney duals without isolated 
vertices and G is nonseparable, then H is nonseparable. 


PROOF: Theorem 32.2 shows that any two edges of G are contained 
in a circuit of G. Hence any two edges of H are contained in a bond 
of H. But if two edges a and 0 are contained in a bond x(X,Y), we 
can find a simple path in the subgraph induced by X, joining one 
end of a and one of b, and a simple path in the subgraph induced 
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by Y joining the other ends, and we find a polygon containing a 
and b. So any two edges of H are contained in a circuit of H. This 
implies that H is nonseparable. 


Corollary. The set of edges incident with a face in a planar em- 
bedding of a nonseparable graph G, not a link-graph, is a circuit 
in G. 


PROOF: The set S of edges incident with a face is the set of edges 
incident with a vertex in G*. But G* is nonseparable by Theorem 
34.5, so S is a bond in G*. Then S is a circuit in G. 


Problem 34F. Two edges a and b of a connected graph G that 
are contained together in a circuit of G are also contained in some 
bond of G. 


Theorem 34.7. Let G and H be simple graphs without isolated 
vertices that are Whitney duals. If G is 3-connected, then H is 
3-connected. 


Proor: Assume G is 3-connected. For « € V(G), let S(x) denote 
the set of edges incident with z. Each set S(x) is a bond with 
the property that s(x) is nonseparable, and these bonds span the 
cutset space of G by definition. 

Thus in H, there exists a family F of circuits that spans the cycle 
space of H and where each member S of F¥ is a circuit with the 
property that Hg is nonseparable. 

We know H is nonseparable; suppose for contradiction that it is 
not 3-connected. Then H is the edge-disjoint union of subgraphs 
A and B of H, each with at least two edges, and where AN Bisa 
graph consisting of two isolated vertices, say x and y. If a polygon 
in H contains edges of both A and B, it must have x and y among 
its vertices. If the edges of such a polygon are contracted, we would 
obtain a separable graph since it would be the union of the graphs 
obtained from A and B by contraction of the edges of the polygon 
in each (which would identify x and y); these contracted subgraphs 
will have only this vertex resulting from the identification of the 
vertices of the polygon in common (and the reader can check that 
there still remains at least one edge in each contracted subgraph). 
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Thus the circuits in F are contained completely in either F(A) 
or E(B). Let C be any circuit of H. Since F spans the cycle space 
of H, CN E(A) and Cn E(B) are, respectively, the modulo 2 sum 
of members of ¥ that are contained in F(A) and E(B). Hence each 
of CO E(A) and CN E(B) would be cycles, and then the minimality 
of C would imply that CN E(A) or Cn E(B) is empty. It follows 
that every circuit of H is contained in either F(A) or E(B), which 
contradicts the nonseparability of H. 


Problem 34G. 

(i) Let G be a simple 3-connected graph and S a bond in G. 
Show that G’, is nonseparable if and only if S is equal to S(«), the 
set of edges incident with x, for some vertex x of G. 

(ii) Prove: If G and H are simple 3-connected graphs with equiv- 
alent codes, then G is isomorphic to H. 


Theorem 34.8. Let G be a simple 3-connected graph embedded 
on the sphere. A circuit C of G is the set of edges incident with 
some face in this embedding if and only if GG is nonseparable. 


PROOF: The circuit C' is the set of edges incident with a face if 
and only if it is the set of edges incident with a vertex in the dual 
H := G". By Problem 34G(i), this is the case if and only if H@ is 
nonseparable. This latter graph is a Whitney dual of G@. 


Thus the polygons that are boundaries of faces in an embedding 
of a simple 3-connected planar graph are uniquely determined. This 
is not true for nonseparable graphs in general—see Fig. 34.5. If we 
know the faces, i.e. the circuits that bound faces, the embedding is 
determined (this is explained somewhat in the next chapter), and 
we have the following result. 


Corollary. A simple 3-connected planar graph has a unique em- 
bedding on the sphere. 


Example 34.4. If Ks were planar, no 4-gon would be a face, since 
contracting the four edges of a 4-gon produces a graph with two 
vertices, four links, and two loops at one of the vertices, and this 
graph is not nonseparable. On the other hand, a triangle would 
necessarily be a face. 
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We can obtain yet another proof that the simple 3-connected 
graphs K;, 33, and the Petersen graph are not planar: contracting 
the edges of any 3-gon in Ks, any 4-gon in K33, and any 5-gon in 
the Petersen graph, always yields a nonseparable graph. So if these 
graphs were planar, every 3-gon, 4-gon, and 5-gon, respectively, 
would bound a face in the embedding in view of Theorem 34.8. 
But every edge belongs to three 3-gons, to four 4-gons, and to four 
5-gons in the respective graphs, a contradiction to planarity. 


E. Steinitz (1922) characterized the graphs of 3-dimensional con- 
vex polytopes. 


Theorem 34.9. A graph G is the graph of a 3-dimensional convex 
polytope if and only if G is simple, planar, and 3-connected. 


That the graph of a 3-dimensional polytope is 3-connected is a 
special case of Theorem 32.5. We do not prove the rest of the 
theorem here. 

We need a lemma for the ‘if’ part of Theorem 34.2. 


Lemma 34.10. Let G and H be graphs with the same edge set 
E.. Suppose G is the union of edge-disjoint subgraphs G'; and G2 so 
that |V(G1) NV(G2)| < 1. Let Hi and Hy» be the subgraphs of H 
that have edge sets E(G,) and E(G2), respectively, with isolated 
vertices deleted. 


(i) IfG and H are Whitney duals, then G; and H; are Whitney 
duals for both i = 1 andi = 2. 


(ii) If G; and H; are Whitney duals for both i = 1 andi = 2 and 
|\V(H1) 0 V(A2)| < 1, then G and H are Whitney duals. 


PROOF: From Theorem 34.1, it is easy to see that every bond of 
G is contained fully in either E(G,) or E(G2). Moreover, a subset 
S C E(G;) is a bond of G; if and only if it is a bond of G. 

Suppose G and H are Whitney duals. If S is a bond of G;, then 
S is a circuit of H contained in E(H;) and so is a circuit of Hj. 
And if S is a circuit of H;, then it is a circuit of H, hence a bond 
of G contained in E(G;) and so is a bond of Gj. 

Suppose that G; and H; are Whitney duals for i = 1,2 and that 
|\V(A1) NV (A2)| < 1. Let S be a bond of G. Then S is a bond of 
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G, or G2 and hence a circuit in either H; or Ha. But then certainly 
S is a circuit in H. If S is a circuit of H, then it is a circuit in 
either H, or Hy. Hence it is a bond of G; or G2 and so is a bond 
of G. 


PROOF OF THEOREM 34.2, CONTINUED: For the proof of the con- 
verse, we first establish the following assertion. Given a pair G, H 
of Whitney duals with the same edge set EF and no isolated ver- 
tices, we claim that there exists a possibly different Whitney dual 
H° which admits what we will call a system of walks for G. This is 
a family {w, : 2 € V(G)} of walks w, in H®°, one for each vertex x 
of G, such that w, traverses in H® exactly the edges incident with 
x in G, loops being traversed twice and nonloops once. We will 
use induction on the number of edges. We may assume G has no 
isolated vertices. 

First note that the assertion is easy if G is nonseparable. Of 
course, it is true for a loop-graph. Otherwise there are no loops 
and for every vertex x, the set S(x) of edges incident with x in G 
is a bond in G and hence a circuit in H. We need only let w; be a 
simple closed path in H traversing the edges of the circuit S(a) in 
Hf. 

If G is not nonseparable, there exist edge-disjoint subgraphs G, 
and Gp», each with at least one edge, and that have no vertices or 
one vertex in common, such that their union is G. With Hy, and 
Hy as in Lemma 34.10, H; is a Whitney dual of Gj, 7 = 1,2. By 
the induction hypothesis, there exist graphs H? that are Whitney 
duals of G; and that admit systems of walks fw? : x € V(G;)} 
for G;, i = 1,2. We may take H? and H? to be vertex disjoint. If 
G, and G» are disjoint, let H® be the disjoint union of H? and H9; 
then the union of the two systems of walks provides a system of 
walks for G. If G, and G2 have one vertex z in common, there are 

O ) in H® and H® that traverse the edges incident 


walks w{!) and w?? 
with z in G; and Gp, respectively. Then let H® be obtained as the 
union of H? and H$ but where we identify some vertex in the walk 
ws) in H with some vertex in the walk w?) in H$. We combine 
these two walks into one which traverses the edges incident with z 
in G; this together with the remaining walks in the systems in Hy 


and Hp provides a system of walks for G. In either case, H® is a 
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Whitney dual of G by Lemma 34.10. 

Finally, we prove the following. Let G be a graph with a Whitney 
dual H which admits a system of walks {w, : « € V(G)} for G. 
We assert that G has an embedding in the sphere so that the edges 
incident with x in G occur in the same cyclic order as they occur 
in the walk w,. We proceed by induction on the number of edges. 

If G has only loops, then H is a forest. Let e be a loop in G 
such that e is incident with a monovalent vertex in H. Let z be the 
vertex incident with e in G. The walk w, in H evidently traverses 
e twice in succession, say 


Wz = (So, €, $1, €, $2 = $9, 43, $3, A4, $4,---, Ak, So), 


where the s;s are vertices of H. The graphs G!, and H!’ are Whitney 
duals and suppressing e in w, will provide a system of walks in H/’ 
for G’,, so that by the induction hypothesis, G! admits a planar 
drawing such that the edges incident with z in G/, occur in the same 
cyclic order as they occur in the walk w, with e suppressed. In a 
small neighborhood of the point representing z in the embedding, 
we can insert the loop e to obtain the required embedding of G (see 
Fig. 34.9). 


Figure 34.9 


Now let e be a nonloop in G with ends y and z. Then G” and 
H! are Whitney duals and we claim that there is a natural system 
of walks {u, : « € V(G%)} in H! for G?. For x # y,z, we take 
Uz <= Wz. For the new vertex 29, obtained by identifying y and z, 
we combine w, and w, omitting e as follows. Say 


Wy = (So, €, $1, Aa, $2, +++, Sk—1, Ak, 80) 
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and 
Wz = (to, e, th, bo, to, tee ybm—1, bes to), 


where €,4@2,03,...,@, and e,bo,b3,...,bm are the edges incident 
with y and z, respectively, in G (loops appearing twice) and the 
sis and tjs are vertices of H. By reversing one of these walks if 
necessary, we can assume So = tp and s; = t;. Then take 


Urg += (s1, 2, 52,-++,Sk-1;, 2k; $0, bins tm-—1) Diets amie) bs, to, by, Sy). 


By the induction hypothesis, G” admits an embedding in the plane 
so that the edges incident with x in G” occur in same cyclic order 
as they occur in the walk u,. In a small neighborhood of the point 
representing x9 in the embedding, we can ‘uncontract’ e to obtain 
the required embedding of G (see Fig. 34.10). 


Figure 34.10 


Problem 34H. In a k-Tutte-connected graph (defined in Prob- 
lem 32D) with at least 2k — 2 edges, every circuit and every bond 
contains at least k edges. 


Problem 34I. A matroid consists of a set X (which we assume 
here to be finite) and a family C of nonempty subsets called circuits 
such that no circuit is a proper subset of another circuit and, if 
A,BéeC,x€ ANB, y€ A\B, then there exists C € C so that 
CCAUB,y€C,« ¢C. This latter property is called the 
‘elimination axiom’. 

(i) Prove that the edge set of a finite graph together with the 
family of all circuits (edge sets of polygons) is a matroid. 

(ii) Prove that the family of minimal nonempty supports of code 
words in a linear code over any field provides the circuits of a ma- 
troid. 
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(iii) Let (X, F) be a combinatorial geometry as defined in Chap- 
ter 23, and let C be the family of minimal dependent subsets of X. 
Show that (X,C) is a matroid. 

(To do all parts of this problem, it can save work to do part (iii) 
first and to derive the others from it.) 


Notes. 

Hassler Whitney (1907-1989), a pioneer in topology who was 
also intensely concerned with mathematical education for the last 
twenty years of his life, was a professor at Harvard and later at 
the Institute for Advanced Study in Princeton. He was awarded 
the Wolf Prize in 1982. Whitney had a degree in music from Yale, 
played the violin, viola and piano, and was concertmaster of the 
Princeton Community Orchestra. 

Matroids were introduced by Hassler Whitney in 1935 in order to 
abstract certain properties of linear independence, of circuits and 
bonds in graphs, and of duality. Their theory is ‘cryptomorphic’ to 
that of geometric lattices. See Welsh (1976) and Crapo and Rota 
(1971). 
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Embeddings of graphs 
on surfaces 


We consider embeddings of graphs on surfaces other than the sphere. 
The classification of surfaces (see Fréchet and Fan, 1967) shows that 
there are two infinite families of surfaces, up to homeomorphism. 
We use J;, g > 0, to denote the orientable surface of genus g (the 
g-torus). This can be realized as a sphere with g ‘handles’ or g 
‘holes’; Jp is the sphere. We use N,,, n > 1, to denote the nonori- 
entable surface which can be constructed by inserting n ‘cross-caps’ 
on the sphere (these cannot be embedded in R?). We state the ex- 
tension of Euler’s formula. A proof may be found in Fréchet and 
Fan (1967), or in Chartrand and Lesniak (1986) for the orientable 
case. 


Theorem 35.1. For a 2-cell embedding of a graph G on Ty, 
|F(G)| — |E(G)| + |V(G)| = 2 — 29. 

For a 2-cell embedding of a connected graph G on N,, 
|F(G)| — |E(G@)| + |V(G)| = 2-1. 


For any embedding, not necessarily a 2-cell embedding, the inequal- 
ities which result by replacing ‘=’ by ‘>’ in the equations above will 
hold. 


Example 35.1. Fig. 35.1 shows an embedding of the Petersen 
graph P on the real projective plane Nj. In this diagram, diamet- 
rically opposite points on the boundary are to be identified. There 
are six faces, each a pentagon. More precisely, the dual graph with 
respect to this embedding is the complete graph Kg. 
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Figure 35.1 


(This establishes a curious one-to-one correspondence between 
the 15 edges of P and the 15 edges of Ke. We have previously seen 
that there is a one-to-one correspondence between the 10 vertices 
of P and the 10 edges of K;.) 

(The real projective plane can be thought of as the surface of the 
sphere where diametrically opposite points have been identified. 
The Petersen graph can be thought of as the graph of a regular 
dodecahedron where diametrically opposite points have been iden- 
tified. ) 


Example 35.2. Fig. 35.2 shows an embedding of the complete 
graph K7 on the torus 7;. We use the usual representation of the 
torus; here the horizontal and vertical boundary lines are to be 
identified. There are 14 triangular faces. The dual graph with 
respect to this embedding has 14 vertices of degree 3 and seven 
hexagonal faces, any two of which are adjacent. This is the Heawood 
graph (defined in Chapter 31), which is given as an example of a 
map on the torus whose faces require seven colors. 


Figure 35.2 
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Theorem 35.2. Ifa loopless graph G admits an embedding in T,, 


g > 0, then 
7+ /1+ 489 


PROOF: Suppose G can be emdedded in 7,. We can assume that 
G is simple, connected and has at least three vertices. We claim 
that there exists a vertex x of degree at most N — 1 where N := 
(7+ /1 + 48g)/2. Once this is established, the theorem will follow 
by induction on the number of vertices; for if this vertex x is deleted 
and the remaining graph properly |N|-colored, there remains a 
color free for x. 
By Theorem 35.1, we have 


22 Bf OO 
where f := |F(G)|, e := |E(G)|, v := |V(G)|. Since every face is 
incident with at least three edges, 3f < 2e. Let d be the average 
degree of the vertices of G. We assume for contradiction that d > 


N —1. This implies that v > N. Also d > 6 since N > 7 for g > 1, 
and we have 


2 1 1 1 
LAGS Be CUS er l= dw 6d), 


12g — 12 > v(d—6) > N(N —7) = 12g — 12. 


This provides the promised contradiction. 


Here is a small table of bounds from Theorem 35.2: 


genus g|1|2|3/4|5|6 | 7/8 | 9 |10/11]12|13| 14/15 
X< |7/8]9/10]11 | 12/12/13] 13] 14/15] 15] 16] 16 | 16 


The assertion that Theorem 35.2 was the best possible result, i.e. 
that [aes | colors were required for some graph which could be 
embedded in Jj, was known as the Heawood conjecture until it was 
proved by Ringel and Youngs in 1968. See Ringel (1974). What 
they did is to prove the following stronger result. 
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Theorem 35.3 (Ringel-Youngs). Given g > 0, let 


isa 


=| ; 


Then the complete graph K,, can be embedded in Ty. 


We will prove here only a small part of this result (see Theorem 
35.4 below). For example, Theorem 35.3 shows that y(G) < 19 for 
graphs G emdedded in 759, 791, and To2. Theorem 35.4 shows that 
Ky9 can be embedded in 729 (and hence in 72; and 72), verifying 
the Heawood conjecture for g = 20, 21, 22. 

We need to understand 2-cell embeddings combinatorially, in 
terms of the graph itself. Here is what is important. 

Given a 2-cell embedding of a graph G on a surface S, we may 
traverse the boundary of any face F’ and we obtain a closed walk wr 
in G. The starting vertex is not important, nor is the direction. In 
many cases, this may be a simple closed path in G, but, e.g., when G 
is a tree embedded in the sphere, there is a single face and this walk 
passes through every edge twice (see Fig. 2.4). In general, every 
edge will either belong to exactly two walks in {wr : F € F(G)} or 
occur twice in one of these walks. A system M of closed walks in 
G with this property that every edge of G occurs twice among the 
walks in M will be called a mesh in G. 


Example 35.3. Here are two meshes in Ks. The walks are de- 
scribed by their vertex sequences: 


M, = {(1,2,3,4,5, 1), 
(1.254. in? (2530) 2)3 (3, 4, 133); (4,5, 2,4), (5, 1,3,5)}, 


Mz = {¢4; 2, 4, 5, 1), (1, 3, 2, 5, 3, 4, 2, 3, 5,4, 1); (1,4, 3, 1,5, 2, 1)}. 


The mesh M, consists of simple paths (one pentagon and five 
triangles), but the second walk in Mg traverses two edges twice. 


A triangular mesh is a mesh consisting of simple closed paths of 
length 3. So a triangular mesh for K,, is equivalent to a 2-design 
S2(2,3,n) with blocks of size 3 and index 2. But these designs 
rarely correspond to embeddings of K,, on surfaces because there 
is an important additional condition that must hold. 
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Given a mesh M in a connected graph G, we may try to construct 
a 2-cell embedding of G in a surface so that these walks bound the 
faces. There is really no choice—we must proceed as follows. For 
each walk w € M of length @, we need a corresponding closed disk 
F),, for example a convex ¢-gon and its interior in the plane (even 
for € = 1 and 2, although we cannot use straight line segments as 
the edges in these cases). Label the vertices and edges of Fi, with 
the vertices of the walk w in the order they occur in w. (So some 
vertices and perhaps edges of F,, will receive the same label if w 
is not a simple closed path.) We will ‘paste’ or ‘sew’ these disks 
together at their boundaries. 

Consider the disjoint union of all disks F,,. For each vertex x of 
G, identify all the points labeled with an x. For each edge e of G, 
identify the interior points of the edges of the disks labeled with 
e in a one-to-one continuous manner in the direction indicated by 
their ends. (It is necessary to agree that loops have two directions. ) 


1 
5 2 1 2 3 4 5 
ae vat ra a ae 
4 3 


Figure 35.3 


Figure 35.4 


Figs. 35.3 and 35.4 show, respectively, polygonal regions in the 
plane (disks) labeled by the vertices of the walks in the meshes of 
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Example 35.3. In Fig. 35.5, we have started the pasting process for 
the disks in Fig. 35.3; it remains to identify diametrically opposite 
points on the boundary (and we obtain the real projective plane 
with K; drawn on it). 


4 
Figure 35.5 


In general, the topological space constructed by this pasting has 
neighborhoods homeomorphic to the open disk (2-cell) about points 
x in the interior of the disks F), and also for points in the interior 
of an edge on the boundary of the disks (because every edge is 
in two of the walks of the mesh). But there may be trouble at 
vertices, where many of the disks F,, meet. For example, suppose 
that the walks that traverse a vertex x in a certain mesh have vertex 
sequences 

(ie dea NG re taal ee cena he de a ae oan sere 
Perea ec rents perme) ey gem Sr oc Pee excl (Maret OMe ames 

The disks corresponding to the first three walks fit together to 
make a neighborhood of x which is a 2-cell, but with the other 
three, a neighborhood of «x will look like two cones touching at 
their apexes. The graph M, defined in the next paragraph would 
consist of two disjoint triangles in this case. 

For a vertex x € V(G), consider the graph M, where V(M,) is 
the set of edges of G incident with x and where two members a, b of 
V(M,,) are adjacent in M, when they occur as consecutive edges 
of some walk in M (with the vertex x between them). Of course, 
join a and b by two edges in M,. if they occur consecutively twice, 
which probably will not often be the case. Clearly, M,. is regular 
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of degree 2. The vertex condition requires that M, is connected 
(i.e. a single polygon) for every vertex «x. 

The topological space constructed by this pasting from a mesh 
in a connected graph is a surface if and only if the mesh satisfies 
the vertex condition. There is a natural embedding of the graph 
G in this surface. The meshes in Example 35.3 satisfy the vertex 
condition. See Example 35.4 and Fig. 35.6. 


Figure 35.6 


To discover what surface S has been constructed from a mesh 
M, we first compute the Euler characteristic h := |M|—|E(G)| + 
|\V(G)|.. In view of Theorem 35.1, if h is odd, we must have 
S = No_;. But if h is even, we must decide whether the sur- 
face is orientable or not (unless h = 2, when the sphere is the only 
possibility): S = T,_p/2 if S is orientable, S = No_p otherwise. We 
state without proof that S is orientable if and only if the walks can 
be directed (one of the two possible directions being chosen for each 
walk) so that every edge is traversed exactly once in each direction, 
in which case we say the mesh is orientable. 

It is easy to check whether a mesh satisfying the vertex condition 
is orientable. We give one walk a direction and that determines the 
directions of all walks sharing an edge, which in turn determines 
the directions of further walks. The vertex condition implies that 
the direction of every walk in the mesh is determined (why?). Then 
we check each edge to see in what directions it is traversed. 


Example 35.3 (continued). The mesh M, provides an embed- 
ding of K; on the real projective plane Nj. The mesh Mg. is 
orientable—the walks may be directed as they have been written 
down—and we have a 2-cell embedding of Ks on the double torus 
Th. 
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In describing walks in simple graphs in the examples that follow, 
we list the vertex sequences in square brackets but suppress the 
terminal vertex since it is the same as the initial one. And it is 
convenient to refer to paths of length 3,4,... as triangles, quadri- 
laterals, etc., ignoring the difference between walks in a graph and 
subgraphs that are polygons. 


Example 35.4. We show that Kg can be embedded on the double 
torus 72. This shows that at least eight colors are required for 7) 
and proves the Heawood conjecture for g = 2. 

There must be 18 faces (so that f—e+v = —2), and their degrees 
must add to 56. We display a solution with two quadrilaterals and 
16 triangles as the faces. We use ‘difference methods’ (cf. Chapter 
19). 

As the vertices of Kg, we take Zg, the integers modulo 8. The 
quadrilaterals will have vertex sequences |0,2,4,6] and [1,3,5, 7]. 
The triangles are to be the translates modulo 8 of the two initial 
triangles {0,1,4] and [0,3,2]. The reader should check that every 
edge appears in exactly two of these walks. Let us check the vertex 
condition. Since translation modulo 8 preserves the polygons, we 
need only check the vertex 0. The walks traversing 0 are [0, 2, 4, 6], 
[(0, 1,4], [7,0, 3], [4,5,0], [0,3, 2], [5,0, 7], and [6,1,0]. They fit to- 
gether to form a single polygon as indicated in Fig. 35.6. 

It remains to check whether the mesh is orientable. It turns out 
that if the quadrilaterals are directed as [0,2,4,6] and [7,5,3, 1], 
and every other triangle is also reversed (i.e. whenever the initial 
triangles are translated by an odd element of Zg), we obtain an 
oriented mesh. 


Problem 35A. Find a mesh of five 4-gons in Ks, a mesh of six 
5-gons in Kg, and a mesh of seven 6-gons in K7, all of which satisfy 
the vertex condition. What surfaces have you constructed? 


Problem 35B. Find a mesh of 16 pentagons in the Clebsch graph 
(see Example 21.4). Check the vertex condition and also check for 
orientability. What surface, if any, has been constructed? What 
is the dual graph with respect to this embedding? (Cf. Problem 
35G.) 


Problem 35C. (i) Show that if a simple graph G is embedded in 
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the nonorientable surface Nj, g > 1, then 


(ii) Show that if a simple graph is embedded in the Klein bottle 
N>, then either it has a vertex of degree < 5 or it is regular of 
degree 6. Show that there is a unique mesh of 14 triangles in K7 
that satisfies the vertex condition and that it is orientable, and so 
K7 does not embed in the Klein bottle. Conclude that y(G) < 6 
for graphs G embedded in No. 

(Thus K7 does not embed in the Klein bottle after all. We con- 
clude that y(G) < 6 for graphs embedded in N53.) 


Problem 35D. Before we continue with surfaces other than the 
sphere, we remark that our understanding of meshes allows a quick 
proof of the fact that if G and H are nonseparable Whitney duals, 
then there exists an embedding of G in the sphere with respect to 
which the dual G* is isomorphic to H. As in the proof of Theorem 
34.2, the set of edges incident with a vertex of H is a circuit in 
G. The system of walks we get in G is a mesh M. Show that M 
satisfies the vertex condition. Explain why the surface constructed 
is the sphere and why the dual G* is isomorphic to H. 


Problem 35E. Consider a connected graph G and a mesh M 
in G satisfying the vertex condition (i.e. a map). To keep things 
simple, assume that G is simple and that the walks in M are simple 
closed paths—which we identify with polygons and call faces. An 
automorphism of this map is a permutation of V(G) which takes 
edges to edges and faces to faces (i.e. an automorphism of G that 
preserves M). Show that if an automorphism a fixes a vertex z, 
an edge e incident with x, and a face F' incident with e, then it is 
the identity. 

(It follows, for example, that a simple map as above with graph 
G has at most 4|E(G)| automorphisms. Examples where equality 
occurs include the Platonic solids and the maps in Examples 35.1 
and 35.2.) 


Theorem 35.8. Ifn =7 (mod 12), then the complete graph K,, 
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admits a triangular embedding on an orientable surface of genus 
g := (n—3)(n — 4)/12. 


PRooF: We construct an orientable triangular mesh in K,, satisfy- 
ing the vertex condition. Write n = 12s+7. As the vertices of K,,, 
we use Z,,. 

The construction is encoded in the diagrams of Fig. 35.7. The 
diagrams are for the cases n = 7,n = 19, n = 31, andn=12s+4+7 
in general. (In the general case, the vertical edges are directed 
alternately and are labeled 1,2,...,2s, consecutively.) We could 
describe this construction simply as an application of difference 
methods and not give any figures, but that would be a shame since 
the diagrams were conceptually very helpful in finding this and 
other embeddings. 

Note that the edges are labeled with the integers from 1 to 6s+3. 
The ‘conservation’ law, or ‘Kirchhoff’s current law’, holds: at every 
vertex, the sum of values on the incoming edges is equal to the sum 
of the values on the outgoing edges. (Except that we have no source 
or sink, this is a flow of strength zero as defined in Chapter 7.) 

Actually, it will be useful to think that the reverses of the indi- 
cated directed edges are also present; the reverse of the edge shown 
in Fig. 35.7 will carry a label equal to the negative of the shown 
edge. Then all nonzero values modulo n = 12s + 7 appear on the 
directed edges, each exactly once. The vertices are of two types: 
solid (representing ‘clockwise’) and hollow (representing ‘counter- 
clockwise’). 

Every vertex in the diagrams of Fig. 35.7 will provide a family of 
n directed triangles in K,, which are translates of one another mod- 
ulo n. For each solid vertex that is the tail of directed edges with 
values a, b, c in clockwise order, we take the n directed triangles 


[z,e+a,xrtat+b, cEZ,. 
Remember that a+b+c= 0 in Z,, so we get the same triangles 


(but with different initial vertices) if we take [y,y+b,y+64c], 
y €Z, or [z,z+¢,2+c+ al], z € Z,. For each hollow vertex that 
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is the tail of directed edges with values a, b, c in counter-clockwise 
order, we take the n directed triangles 


[z,eta,ztat+d], 2rEeZ,y. 


For example, in the third diagram in Fig. 35.7 (when n = 31), 
there is a hollow vertex with incoming edges labeled 7 and 2 and an 
outgoing edge labeled 9. We regard this as outgoing edges labeled 
—7, —2, and 9 (counter-clockwise), and take the 31 directed trian- 
gles obtained as the translates of [0,-—7,—9] modulo 31 (or trans- 
lates of [0, 9, 2] or [0, —2, 7]). Another vertex in that diagram yields 
the 31 directed triangles obtained as the translates of [0, 14, 13] 
modulo 31. 

It is not hard to see that the collection of directed triangles ob- 
tained from the diagrams form an oriented triangular mesh. For 
example, when n = 31, to find a triangle traversing the edge joining 
4 to 23 in that direction, notice that 23 —4 = —12 in Z,. The edge 
labeled —12 leaves a solid vertex with other outgoing edges labeled 
15 and —3, and the triangle [4, 4+ (—12), 4+ (—12) +15] = [4, 23, 7] 
is found. The triangle traversing the edge joining 23 to 4 in that 
order is found to be [23, 4, 6]. 


Figure 35.7 
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It will suffice to check the vertex condition only at 0. Let e; 
denote the edge {0,i} of K,. Consider the triangle of the mesh 
that traverses 0 and enters 0 on edge e,. In the diagram, the edge 
labeled a will enter a vertex that is the tail of edges with labels 
—a, p, q, say, in clockwise or counter-clockwise order, depending 
on whether the vertex is solid or hollow. That is, the triangle is 
[a,0,p| and leaves 0 on the edge e,. To summarize, if a triangle 
enters 0 on an edge e; of K,, the value j such that the triangle 
leaves 0 on e; is found from the diagram as follows: find the edge 
labeled 7 in the diagram—turn clockwise or counter-clockwise, as 
appropriate, at the head of that edge and leave on the next edge— 
the label on that edge of the diagram will be 7. 

For example, when n = 31, we find the sequence of edges e;, of 
kK, so that the triangle which enters 0 on e;, will leave on e;,,, to 
be 


€1, €_13, €-8, €_-g, €-7, €-10, €-6, €5, 


€13, €14, €12, €15, €11, €6, €E—4, €-15, €-3, €7, CE-2, €-14, 


€_1, €8, €-5, €-11, €4, €10, €3, CE-12, €2, €9, €1,.--- 


Consecutive pairs above will be adjacent in Mo, which is seen to 
be a single polygon. The reader should check that all diagrams 
in Fig. 35.7 will similarly yield under this procedure a sequence 
traversing each edge of the diagram once in each direction. This is 
quite amazing. 


Problem 35F. (i) If the complete graph K,, admits an embed- 
ding on the orientable surface J, so that all faces are bounded by 
triangles, then g = (n — 3)(n — 4)/12. 

(ii) If the complete bipartite graph K,,,, admits an embedding on 
the orientable surface J, so that all faces are bounded by quadri- 
laterals, what is g in terms of n? What is the least value of n > 2 
for which such an embedding exists? 


In order to present the proof of Theorem 35.8 as quickly and 
as simply as possible, we have taken the construction out of its 
original context. We close this chapter with a brief discussion of 
some of the ideas that allowed Ringel and Youngs to come up with 
constructions for all values of n. 
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A technique for embedding graphs on orientable surfaces was de- 
scribed by J. Edmonds (1960). It may be thought of as arising 
from the mesh in the dual graph with respect to the embedding. 
If we have a connected simple graph G embedded in an orientable 
surface S, then the choice of an orientation on S will induce at 
every vertex a cyclic permutation of the edges incident with that 
vertex. Edmonds pointed out that any such ‘local cyclic permuta- 
tions’ determine an embedding. Let G be a simple graph. Suppose 
a cyclic permutation o, of the set S(a) of edges incident with x is 
given for each vertex x. Then there is a natural oriented mesh M 
determined as follows. In the directed walk traversing an edge e 
directed from one of its ends y to the other x, the edge following e 
is to be a,(e). 

The examples below concern simple graphs, so the local permuta- 
tions may be abbreviated as permutations of the adjacent vertices 
and walks may be described by their vertex sequences. 


Example 35.5. Consider Ks with vertex set {1, 2,3, 4,5} and the 
following local permutations: 


or wd Fe 
peat: aia ns GOT i: a i Mer Ot, 
i 
— 
bo 
Or 
i i ee ti ie ed 


It might be good to think of the resulting walks as a decomposition 
of the edges of the complete directed graph on 5 vertices. Let e;; 
denote the edge directed from 7 to 7. Here are the edge terms of 
the walks: 
(€12, €24, €45, 51), 
(€13, €32, €25; €53, €34, €42, €23, €35; E54, €41), 
(€14, €43, €31, €15, 52, €21). 

The vertex sequences of these walks are exactly those of Mg. in 

Example 35.3. 


In the family of graphs with 4s + 2 vertices in Fig. 35.7, we 
specified a cyclic order at each vertex by the use of a solid vertex 
to indicate that the local permutation is to be clockwise and a 
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hollow vertex to indicate counter-clockwise. As part of the proof of 
Theorem 35.8, we had to check that the resulting embeddings have 
exactly one face. 

Given a finite group TI and a subset S' of nonidentity elements of 
I such that a € S$ implies that a7! € S, the Cayley graph G(T, $) 
is the simple graph with vertex set [ and where vertices a@ and 
@ are adjacent if and only if Ga“! € S. A complete graph K,, 
is a Cayley graph with respect to any group of order n, where S 
consists of all nonidentity elements. Often it is required that S 
generates the group—this ensures that the corresponding Cayley 
graph is connected. 

W. Gustin (1963) introduced the theory of ‘quotient manifolds’ 
and developed methods to embed Cayley graphs into orientable 
surfaces. We have no space to describe this precisely, but give 
some examples. Consider the case where the local permutations 
at the vertices of a Cayley graph are all ‘shifts’ of an initial cyclic 
permutation (s; 52 ... s,) of the set S of vertices adjacent to 0; 
that is, the local permutation at a is 


(s3 ta Sota ... spt+a) 


(with additive notation). 


Example 35.6. Consider K7 with vertex set {0,1,2,3,4,5,6} and 
with local permutation (1+73+72+76+74+%5+47) (mod 7) 


at i as below: 

0: (132645) 
(243056) 
(354160) 
(465201) 
(506312) 
(610423) 
( ) 


021534). 


Oow»rkwnrF 


The walks are: 
[1,2,4] [2,3,5] [3,4,6] [4,5,0] [5,6,1] [6,0,2] [0,1,3] 
[3,5,6] [4,6,0] [5,0,1) [6,1,2] [0,2,3] [1,3,4] [2,4,5]. 


This is the triangular embedding of K7 on the torus we have seen 
in Example 35.2. 
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If we try this method on K3), for example, with vertex set Z31 
and initial local permutation 


(13.8) 97 10 65.1314 19 15 1G, a 
S153 9 14 SR 5 hg 19 9) 


we obtain the same triangular mesh as in the proof of Theorem 
35.8. The diagrams of Fig. 35.7 actually represent the ‘quotient 
manifolds’ of the dual graphs of K,, with respect to the prescribed 
embeddings. For n = 12s + 7, the embedded graph represented 
by the diagram has 4s + 2 vertices, 6s + 3 edges, and 1 face; the 
resulting embedding of K,, has (45+ 2)n faces, (6s+3)n edges, and 
(1)n vertices. 


Problem 35G. Let w be a nonzero element in F, where gq is a 
power of a prime. Say w has order m in the multiplicative group 
and assume that —1 is a power of w. Consider the Cayley graph 
G(EF,, (w)) where (w) = {1,w,w?,...}. We are using the additive 
group of F, for the Cayley graph, so a and b are adjacent if and 
only ifa—b € (w). (This is a complete graph when w is a primitive 
element in F,; it is the Clebsch graph when g = 16 and m = 5.) 

Let us take the local permutation at 0 to be (1,w,w?,...,w’”') 
and obtain the other local permutations as shifts. What is the size 
and number of the faces in the resulting mesh? 


Notes. 

P. J. Heawood (1861-1955) spent most of his career at Durham, 
England as professor of mathematics, finally to become vice- 
chancellor. 


References. 

G. Chartrand and L. Lesniak (1986), Graphs and Digraphs, 2nd 
edn., Wadsworth. 

J. Edmonds (1960), A combinatorial representation for polyhedral 
surfaces, Notices Amer. Math. Soc. 7, 646. 

M. Fréchet and K. Fan (1967), Initiation to Combinatorial Topol- 
ogy, Prindle, Weber and Schmidt. 


506 A Course in Combinatorics 


W. Gustin (1963), Orientable embedding of Cayley graphs, Bull. 
Amer. Math. Soc. 69, 272-275. 

G. Ringel (1974), Map Color Theorem, Springer-Verlag. 

G. Ringel and J. W. T. Youngs (1968), Solution of the Heawood 
map coloring problem, Proc. Nat. Acad. Sci. U.S.A. 60, 438-445. 

A. T. White (1973), Graphs, Groups, and Surfaces, Mathematical 
Studies 8, North-Holland. 


36 


Electrical networks and 
squared squares 


We start with the matriz-tree theorem, which expresses the number 
of spanning trees in a graph as the determinant of an appropriate 
matrix. 


Theorem 36.1. The number of spanning trees in a connected 
graph G on n vertices and without loops is the determinant of 
any n—1 xX n—1 principal submatrix of the matrix D — A, where 
A is the adjacency matrix of G and D is the diagonal matrix whose 
diagonal contains the degrees of the corresponding vertices of G. 


For multigraphs G without loops we agree that the adjacency 
matrix A is defined to have A(x, y) equal to the number of edges 
joining x and y, for distinct vertices x and y. We postpone the proof 
until we have developed some tools, but we give some examples now. 


Figure 36.1 


Example 36.1. Let G be the graph in Fig. 36.1. Then 


2.; = i> 10, 
=i) -a, Sa’ eS 
OF mle we sl 
=) '=L 1) <3 
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and G has 8 spanning trees. 


Example 36.2. Take G to be the complete graph K,, in Theorem 
36.1. In this case, the matrix D — A is nI — J, where I is the 
identity matrix of order n and J is the n x n matrix of all 1s. The 
calculation of the determinant of an—1 x n—1 principal submatrix 
of this simple matrix can be done in several ways (row operations, 
consideration of eigenvalues) and is left as an exercise. Once this is 
done, we obtain yet another proof of Cayley’s theorem, cf. Chapter 
2, that the complete graph K,, has n”~? spanning trees. 


Problem 36A. Let M be ann x n matrix all of whose line-sums 
are 0. Then one of the eigenvalues of M is A; = 0; let Ag, A3,.--, An 
denote the other eigenvalues. Show that all principal n—1 x n—-1 
submatrices of M have the same determinant, and this value is 
equal to the product t2A3 see Xp. 


Example 36.3. We can use the observation of the above problem 
to calculate the number of spanning trees in regular graphs whose 
spectrum is known. For example, let A be the adjacency matrix of 
the Petersen graph. Then A has eigenvalues 


eel gl OR I eee” ei ie 
(see Chapter 21). The matrix M is 3] — A, which has eigenvalues 
0,2,2,2,2,2,5,5,5,5. 
We conclude that the Petersen graph has 2000 spanning trees. 


The following lemma is known as the Cauchy—Binet theorem. It 
is more commonly stated and applied with the diagonal matrix A 
below taken to be the identity matrix. 


Lemma 36.2. Let A and B be, respectively, r x m and m x r 
matrices. Let A be the m x m diagonal matrix with entry e; in 
row i, column i. For an r-subset S of {1,2,...,m}, let As and B® 
denote, respectively, the r x r submatrices of A and B consisting 
of the columns of A, or the rows of B, indexed by elements of S. 
Then 
det(AAB) = Nl (Az) det ( BP) ) [Te 
ieS 


36. Electrical networks and squared squares 509 
where the sum is extended over all r-subsets S of {1,2,...,m}. 


PROOF: We will prove this assuming that e),...,e,, are indepen- 
dent indeterminates. Of course, it will then hold for all values of 
€1,---,€m- 

The entries of the r x r matrix AAB are linear forms in the 
indeterminates €1,...,€m; explicitly, if A = (a;;) and B = (bj;), 
then the (i,k) entry of AAB is }""_, aijb;,e;. Thus det(AAB) is a 
homogeneous polynomial of degree r in €1,...,€m-. 

Consider a monomial ej'e? ... where the number of distinct in- 
determinates e; that occur, i.e. that have exponent t; > 0, is less 
than r. Substitute 0 for the indeterminates e; which do not occur 
in ee? .... The monomial e ey ... and its coefficient are unaf- 
fected by this substitution. But after this substitution, the rank of 
A is less than r so the polynomial det(AAB) evaluates to the zero 
polynomial. 

So we see that the coefficient of a monomial in the polynomial 
det(AAB) is 0 unless the monomial is a product of r distinct inde- 
terminates e;, i.e. unless it is of the form |],~, e; for some r-subset 
S. The coefficient of a monomial [[,.¢e; in det(AAB) can be ob- 
tained by setting e;, 7 € S, equal to 1 and e,, j ¢ S, equal to 0. 
When this substitution is made in A, AAB evaluates to AgB*. So 
the coefficient of [],<¢ e; in det(AAB) is det(Ag) det(B°). 


We will deal extensively with directed graphs H in this chapter. 
All graph-theoretic terms (trees, components, circuits, etc.) will 
be applied to a directed graph with the same meaning as for the 
underlying undirected graph we obtain by ignoring the directions 
of the edges. Thus a path in a digraph will traverse some edges in 
a ‘forward’ direction, other edges will be traversed ‘backward’. It 
is really undirected graphs we are interested in, but it is convenient 
for the theory to choose an orientation and produce a digraph; any 
orientation will do and produces essentially the same theory. 

As we have often done in previous chapters, we use the functional 
notation M(i,7) to denote the entry in row 7 and column j of a 
matrix M, and f(z) to denote the ith coordinate of a vector f. 

The incidence matrix N of a directed graph H is the matrix 
whose rows are indexed by V(H), whose columns are indexed by 
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E(#), and where 
O if x is not incident with e, or e is a loop, 
N(z,e)= 4 1 if a is the head of e, 
—1 if x is the tail of e. 


We mention that 
(36.1) rank(N) = |V(H)| — |C(A)| 


where C(#) is the set of components of H. To see this, suppose 
that g is a row vector with coordinates indexed by V(#) and that 
gN = 0. This means for every edge e, directed from x to y, say, 
that g(y) — g(x) = 0. It is clear then, that gN = 0 if and only if g 
is constant on the vertex set of every component of H, and hence 
the dimension of the space of all g such that gN =0 is |C(H)|. 

We also mention that the determinant of any square matrix that 
has at most one 1 and at most one —1 in any column, all other 
entries being 0, is equal to 0 or +1. This follows by induction: 
if every column has a +1 and a —1, then the sum of all rows is 
the zero vector, so the matrix is singular. Otherwise, expand the 
determinant by a column with one nonzero entry to find it is +1 
times the determinant of a smaller matrix with the same property. 
So every square submatrix of the incidence matrix of a digraph has 
determinant 0 or +1. (Matrices with this property are called totally 
unimodular. ) 


PROOF OF THEOREM 36.1: Let H be a connected digraph with n 
vertices, with incidence matrix N. Let S be a set of n — 1 edges 
and, with the notation of the Cauchy—Binet theorem, consider the 
n X n—1 submatrix Ng, whose columns are indexed by elements of 
S, of the incidence matrix N. By (36.1), Ns has rank n — 1 if and 
only if the spanning subgraph of H with edge set S is connected, 
i.e. if and only if S' is the edge set of a tree in H. Let N’ be obtained 
by dropping any single row from the incidence matrix N. Since the 
sum of all rows of N (or Ng) is the zero vector, the rank of Ng 
is the same as the rank of Ng. The observations of this and the 
preceding paragraph prove that 

(36.2) 


det(Ni,) = { +1 if S is the edge set of a spanning tree in H, 


0 otherwise. 
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Let a connected loopless graph G on n vertices be given, let H 
be any orientation of G, and let N be the incidence matrix of H. 
Then NN' = D— A because 


NN'(a,y) = Ss" N(z,e)N(y, €) 


ec E(G) 
_ f deg(x) ife=y, 
aa ee, if x and y are joined by t edges in G. 


An n—1 xX n—1 principal submatrix of D — A is of the form 
N'N'' where N’ is obtained from N by dropping any one row. By 
Cauchy—Binet, 


det(N'N'") = S~ det(N4) det(V4") = S> (det(N4))?, 
S S 


where the sum is extended over all (n — 1)-subsets S of the edge 
set. By (36.2), this is the number of spanning trees of G. 


Remark. If we view E(G) as a set of indeterminates and apply 
Lemma 36.2 to n —1 x n—1 principal submatrices of NAN! 
where here A is the diagonal matrix whose rows and columns are 
indexed by the edges and where the diagonal entries are the edges 
themselves (i.e. A(e,e) := e), we find that det(NAN') is the sum 
of monomials corresponding to the edge sets of the trees in G. For 
example, for the graph G in Fig. 36.1, 


a+b —b 0 —a 

To) “= Bbepek ie =e 

NE 0 —c¢ c+d —d 
—a —e -d at+d+e 


Of course, NAN' is a singular matrix (all line-sums are 0), but 
the determinant of the 3 x 3 matrix obtained by dropping, e.g., 
the last row and column is 


(a+ b\(b+c+e\(ce+d)—b(c+d)—C(a+b)= 
abc + abd + acd + ace + ade + bed + bce + bde, 
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representing the eight spanning trees in G. 


We now give a generalization of Theorem 36.1 due to W. T. 
Tutte (1948), with a proof that is shorter than the one we gave 
for the first theorem! (The generalized statement allows a type 
of induction that is not possible in the original.) The statement 
of Theorem 36.1 is recovered when we apply Theorem 36.3 to the 
directed graph obtained from an undirected graph by replacing each 
undirected edge with a pair of directed edges, one in each direction. 


Theorem 36.3. Let G be a directed graph with vertices x1,..., Xn. 
Define the matrix M, or M(G), as follows: M = (m;;) where mj; is 
the number of edges pointing away from x;, not counting loops, and 
fori A j, mi is the negative of the number of edges directed from 
x; tox;. Then the number Ay, or Ac(G), of spanning arborescences 
of G with x, as root is the minor of position (¢,0) in M, ie. the 
determinant of then—1 x n—1 principal submatrix of the matrix 
M obtained by dropping row ¢ and column ¢. 


PROOF: For notational convenience, we take @ = 1. If, for some 
i > 1, there are no edges directed away from x;, then the statement 
of the theorem holds: row i of M(G) is the zero-vector; both the 
minor of position (1,1) and the number of arborescences are zero. 

If, for some 7 > 1, there is more than one edge directed away 
from x;, let G; and G» be directed graphs equal to G except that 
some (> 1) of those edges are put into G; and the others (> 1) 
are put into Gy. Since every spanning arborescence of G with root 
x1 must contain exactly one edge directed away from x;, Ai(G) = 
A;(G,) + Ai(G2). Also note that M(G), M(G,), and M(G») are 
identical except for the ith rows, and the ith row of M(G) is the 
sum of the ith rows of M(G,) and M(G2). So the minor of position 
(1,1) in the first matrix is the sum of those minors in the latter 
two matrices. 

Hence it will be sufficient to verify the statement of the theorem 
for digraphs G such that m;; = 1 for alli > 1. That is, with the 
property that there is a unique edge directed away from any vertex 
vj,%4> 1. 

If such a digraph G has a directed closed path not passing through 
X1, SAY Lo — 13,X3 — V4,...,L_E — Xo, then the sum of rows 2 
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through k is 0 and hence the minor of position (1,1) is 0. There 
are no directed paths from any of these vertices to x1, so there are 
no spanning arborescences. 

If such a digraph G has no directed closed paths not passing 
through x, the directed path starting at any vertex x,;, and fol- 
lowing the edge directed away from each vertex that is met, must 
reach x;. We see that the digraph G, when edges leaving x, are 
deleted (these edges may not occur in any spanning arborescence), 
is itself a spanning arborescence; so A;(G) = 1. The minor of posi- 
tion (1,1) in M(G) is also seen to be 1, e.g. by induction: expand 
the determinant by a column j corresponding to a vertex x; that 
has indegree 0 in the arborescence (and so a 1 in position (j, 7) and 
Os in positions (i,7),7>1,i4 7). 


We remark that if the edges of a digraph G are viewed as in- 
dependent indeterminates and a matrix like M(G) is formed but 
where the entries are formal sums of edges rather than integers, 
then the minors are the formal sums of all arborescences, each rep- 
resented by the product of its edges. The reader may easily modify 
the above proof. For example, if G is the digraph on vertices 1, 2,3 
with edge a directed from 2 to 1, edge b directed from 3 to 1, and 
edge c directed from 3 to 2, then we consider the matrix 


0 0 0 
M=1|{-a a 0 
—b -c bee 


The minor of position (1,1) is ab + ac, which represents the two 
arborescences rooted at 1. 


Problem 36B. (Easy.) Let T), be the ‘transitive tournament’ on 
vertices 1,2,...,n with edges directed from 7 to 7 whenever 7 < j. 
What is M(T,,)? What is the number of spanning arborescences 
rooted at 1 in T},? Describe all such arborescences. 


We can now give a second proof of Theorem 8.2 on the number of 
De Bruijn sequences. Let the digraph G,, be as defined in Chapter 8. 
All the indegrees and outdegrees of G,, are equal to 2. The solution 
of Problem 2B suggested in Appendix 1 shows that the number of 
(directed) Eulerian cycles in G,, is equal to the number of spanning 
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arborescences in G,, rooted at any vertex. By Theorem 36.3 and 
Problem 36A, this number can be obtained from knowledge of the 
eigenvalues of the matrix M(G',). 

Number the rows and columns of a matrix B,, of size 2”~! by the 
integers 0 to 2"-'— 1. Let b;; = 1 if 7 = 22 or 22 +1 (mod 2”). 
(Note that the top half and bottom half of B, are the same.) Then 
M(G,) = 2I — B,. We now calculate the determinant D,,(x) of 
xzI — M(G,,). By the obvious row and column operations on the 
lower half of this determinant we find 


gn—2 


Dili) =e =2)- Dy ale). 


Since D2(x) = x(a — 2), we see that all the nonzero eigenvalues of 
M,, are equal to 2 and hence by Problem 36A, the number of De 
Bruijn cycles of length 2” is 


2-2-1 _ 92"~ 4-1 


Qn-1 


Let N be the incidence matrix of a digraph H. The rowspace 
of N is called the coboundary space of H; the nullspace of N', 
i.e. the space of all row vectors f such that fN' = 0, is called the 
cycle space of H. These are of importance in the theory of electrical 
networks and algebraic topology. From (36.1), the dimension of the 
coboundary space of a digraph H is |V(H)| — |C(#)|, and hence 
the dimension of the cycle space is |E(H)| — |V(H)| + |C(A)|. 

From the definition, a vector f is a cycle if and only if for each 
vertex x, the sum of the values f(e) on outgoing edges is equal to 
the sum of the values f(e) on incoming edges. An example of a 
cycle f (a so-called elementary cycle) is obtained from any simple 
closed path p in H by defining f(e) to be +1 on the forward edges 
of the path, —1 on the backward edges, and 0 on all edges not in 
the path. A vector is a coboundary if and only if it is orthogonal to 
all cycles, so in particular, the sum of the values of a coboundary 
on the forward edges of a simple closed path will equal the sum of 
the values on the backward edges of the path. 
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Problem 36C. A vector g with coordinates indexed by E(#) is 
a coboundary on a digraph H if and only if for every closed walk 
w, the signed sum of the values of g on the edges (the sum of the 
values of g on the forward edges of w minus the sum of the values 
of g on the backward edges of w) is zero. 


Let us agree that an electrical network consists of a digraph H, 
together with a function r associating to each edge e a ‘resistance’ 
r(e) > 0, and a function s associating to each edge e an ‘impressed 
electromotive force’ or ‘voltage source’ s(e). That is, each edge is 
to be thought of as a resistor or a battery or both. If batteries and 
resistors are connected, a ‘current flow’ f(e) will pass through each 
edge and a ‘potential difference’ or ‘voltage’ g(e) will be measurable 
across each edge. All that we need to know here is that the vectors 
f and g are determined from r and s by the rules called Kirchhoff ’s 
laws and Ohm’s law. Kirchhoff’s current law and voltage law assert, 
respectively, that f is a cycle and that g is a coboundary. Ohm’s 
law says that g(e) = —s(e) + r(e)f(e), or, in matrix notation, g = 
—s+ fR where R is the diagonal matrix with the numbers r(e) on 
the diagonal and where we think of s, g, and f as row vectors. 


Problem 36D. Suppose {e : r(e) = 0} contains no circuits, i.e. 
is the edge set of a forest. Show that the electrical network has a 
unique solution, by which we mean that for any s there exist unique 
vectors f and g satisfying the above laws. 


15 


10 


18 
14 


Figure 36.2 
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By a squared rectangle we mean a rectangle which has been par- 
titioned into a finite number of squares. Two examples appear in 
Figs. 36.2 and 36.3. These rectangles have dimensions 33 by 32, 
and 177 by 176, respectively. Squared rectangles are thought to be 
elegant when all internal squares have distinct sizes and when they 
contain no smaller squared rectangle. 


78 
99 
21 
OT 
43 
77 
9 16 
41 
34 25 
Figure 36.3 


Problem 36E. Find integral dimensions for the small squares in 
Fig. 36.4 so that the diagram represents a squared rectangle. 


Figure 36.4 
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For many years, it was not known whether a square could be par- 
titioned into a finite number of unequal smaller squares. In the late 
1930s, such squared squares were finally discovered. One example 
was shown on the cover of the Journal of Combinatorial Theory 
until a smaller example was found by A. J. W. Duijvestijn (1978); 
the cover illustration was updated in January of 1979. This is a 
partition into the least possible number (21) of squares. Comput- 
ers are used for these results, but an important tool in the study of 
squared rectangles is the connection with electrical networks which 
was discovered by four undergraduates at Cambridge in 1936-1938. 
A popular account of some of their work is given by W. T. Tutte 
(1961) and (1965) who was one of them. 

We give an informal description of this connection with electrical 
networks. Let us use the term special network for an electrical 
network with a distinguished edge ep so that s(e) = 0 for e F e, 


and where 
(6) 1 ife#e, 
r(e) = 
0 ife= €0- 


That is, we have a system of unit resistors and a single battery. 


25 
36 
7) : 16 
ac <4 61 
oo 
33 be 
Figure 36.5 


A special electrical network arises from a squared rectangle as 
follows. Define a digraph H whose vertices are the maximal hor- 
izontal line segments in the diagram. The edges of H are to be 
the small squares, and we include one special edge eg for the entire 
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rectangle. We direct each edge (except e 9) downward, i.e. towards 
the lower line segment. An example is shown in Fig. 36.5. The 
digraph H is clearly planar. There is a real number f(e) naturally 
associated to edge e, namely the size of the corresponding small 
square; we take f(e9) to be the width of the large rectangle. It is 
not hard to see that f is a cycle on H: for each maximal horizontal 
line segment, the sum of the sizes of the squares ‘sitting on’ that 
segment is equal to the sum of the squares ‘hanging from’ it. More- 
over, were it not for the special edge, f would also be a coboundary: 
if for each vertex (horizontal line segment) x, we take h(x) to be 
the distance of the line segment from the top side of the large rect- 
angle, then for e 4 €9, f(e) is equal to h(head of e) — h(tail of e). 
So if we let s(eo) be the height of the rectangle and take g to be 
the coboundary of h, then f and g are the current flow and voltage 
vectors which solve this special network, i.e. g = —s+ Rf. 

Conversely, a special electrical network where the digraph is pla- 
nar leads to the construction of a squared rectangle. We do not 
give a proof. The ratio of its width to its height will be the ratio of 
the current flow f(e9) to the voltage —g(e9) in the solution to the 
network. (Some of the edges may have negative current flow when 
the network is solved, but we could reverse the direction on these 
edges to get a positive value. The edges with zero current flow may 
be deleted—or contracted; they will not give rise to squares.) Thus 
a squared square corresponds to a planar network of resistors which 
have the net effect of a resistance of one ohm. 


Problem 36F. Solve the special networks in Fig. 36.6 below by 
any means you wish, and sketch the corresponding squared rectan- 
gles. The dual graphs may also be thought of as special networks— 
what are the squared rectangles that arise from the dual networks? 


Figure 36.6 
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Let N be the incidence matrix of a digraph D and A a diagonal 
matrix indexed by E(D) with A(e,e) = r(e) > 0, the resistance of 
the edge e; let C := NA !N'. If an additional edge eg is directed 
from one vertex y to another vertex x of D, and a sufficient voltage 
source is impressed on that single edge so that a current f(eg) flows 
through it, then Kirchhoff’s laws imply that the potential difference 
g(e) across an edge e joining vertex a to vertex b, say, is given by 


(36.3) g(e) = 


where (xy.ab) denotes the cofactor of the element c,» in the cofactor 
of the element Crq of C = (c;;) and 7(D) denotes the number of 
spanning trees of D (the complexity of D). See Jeans (1908). 

A consequence of this is the following theorem. 


Theorem 36.4. If the voltage source s(e) on the distinguished 
edge ey of a special network with graph G is taken to be the number 
of spanning trees on the edge €o, then all values f(a) of the resulting 
current flow will be integral, and the current flow f(e 9) through the 
distinguished edge will be equal to the number of spanning trees 
off the edge eo. 


PROOF: In the discussion preceding the statement of the theorem, 
take D to be an orientation of G., and A the identity matrix. If 
the current flow f(eo) is T(D) = 7(G{,), then it is clear from (36.3) 
that all values of f(a) = g(a) are integral. It remains only to show 
that —g(eo) is equal to the number of spanning trees on the edge 
€0- 

By (36.3), g(eo) = (ry.yx) = —(ay.xy). By definition, (ry.ry) 
is the determinant of the matrix N’N”' where N” consists of N 
with rows x and y deleted. By the Cauchy—Binet theorem, this is 
equal to 

S| (det(Ng))” 


S 


where the sum is extended over all (n — 2)-subsets S of the edge 
set and N¢ is the square submatrix of N” whose rows are indexed 
by elements of S. 
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The proof is completed by the observation that det(VZ) is 0 
unless S U {eo} is the edge set of a spanning tree in G, in which 
case it is +1. This is similar to the proof of (36.2) and is left as an 
exercise. 


In summary, Theorem 36.4 says that the resistance between the 
ends of an edge €9 of a graph G when all other edges are thought of 
as unit resistors, is the ratio T(GZ,)/7(Gi,). Readers might check 
that this agrees with their knowledge of electrical networks when 
G is a polygon or a bond-graph. A squared square arises when we 
find a planar graph G and an edge eo so that T(G¢,) = T(G.,). This 
statement alone does not necessarily help in the search for squared 
squares (and electrical engineers will think this would be a waste 
of resistors anyway). Such graphs appear to be quite rare. 


Notes. 

The first appearance in the literature of the problem of using in- 
congruent squares to make a rectangle seems to be M. Dehn (1903). 

The first extensive tables of squared squares were published by 
C. J. Bouwkamp, A. J. W. Duijvestijn, and P. Medema (1960). 

W. T. Tutte (1918—) has made numerous significant contributions 
to graph theory. He is Emeritus Professor of Mathematics in the 
Department of Combinatorics and Optimization of the University 
of Waterloo, one of the few academic departments in the world 
whose name includes the word ‘combinatorics’. 

For the proof of Theorem 36.3, we have followed De Bruijn and 
Van Aardenne-Ehrenfest (1951). 

One of the Cambridge students mentioned earlier, C. A. B. Smith, 
wrote a very nice account of the history of their collaboration, called 
‘Did Erdés save Western Civilization?’ (see Graham and Nesetiil 
(1997)). Apparently, Erdés conjectured that any dissection of a 
square into smaller squares must contain at least two squares of 
equal size. One thing led to another and Tutte ended up at Bletch- 
ley Park instead of in the army. It is rumored that there he supplied 
the vital clue to the deciphering of the Enigma, which explains the 
title of the paper. 
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ov 
Polya theory of counting 


We return to counting in this chapter. There are many instances 
when we are not interested in the number of primitive objects, but 
rather the number of equivalence classes of objects with respect to 
an appropriate equivalence relation. Moreover, these equivalence 
relations are often induced by certain permutation groups in a nat- 
ural way. 


Question 1. What is the number of ‘essentially different’ neck- 
laces which can be made with n beads of two different colors? For 
n = 6, this number is, by inspection, 13. 


eh EP E> ED Ee ED Ke 
oS Oh Ge SS Se Se 
Note that we choose to regard the necklaces 
(37.1) $B and g% 


as essentially the same, since one can be reflected through a ver- 
tical axis (or flipped over) to get the other. Evidently, underlying 
our concept of ‘essentially different’ is the dihedral group D,, of 
automorphisms of the n-gon. 


Question 2. What is the number of nonisomorphic simple graphs 
on n vertices? For n = 4, there are 11. 


Question 3. What is the number of essentially different ways to 
paint the faces (or edges, or vertices) of the cube with n colors? 
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Question 4. Given three white balls and one black ball, what is 
the number of ways to distribute them into two square boxes and 
one round box? 


fooce] [] () [occ] [e] () [00] [oe] () [oce] [0] () 
[000] [] @) [00] [c] (e) fooe] [] ©) [00] [e] ©) 

[oe] [o] ©) [oe] [] ©@o) [0] [e] (©0) [00] [] ©e) 
[0] [ce] @¢@) [e] [] @00) [0] [] @ce) [] [] @ooe) 


Let A and B be finite sets and G a group of permutations of A 
(or more generally, a finite group acting on A). The elements of B 
will be called colors. The group G acts on the set B4 of mappings 
f : A— B when for o € G and f € BA, we define o(f) € B4 by 


(o(f))() = f(o"'(2)). 


It is the orbits of an appropriate G on the set B4 that we wish to 
count in the first three questions above. 

(The use of o~' rather than o on the right-hand side above is 
not an error. It is required to ensure that o(7(f)) = (o7)(f), 
which is necessary in order to have a legitimate ‘action’ of G on 
BA. Formally, we have a homomorphism of G into the symmetric 
group on B4.) 

In the necklace problem, we would take A to be the vertex set 
of an n-gon, and G as its automorphism group represented as a 
permutation group on the vertices (i.e. the dihedral group D,, of 
order 2n in its usual representation). A slightly different problem 
arises when the beads to be used are not round, but, say, flat on 
one side. Then the necklaces cannot be flipped over and those in 
(37.1) are regarded as distinct. 

Here, we would take G to be the cyclic group of order n in its 
regular representation; cf. Example 10.5. 

Asking for the number of orbits of mappings is the simplest ques- 
tion we can pose. We might further want to know the number of 
orbits of a given ‘weight’; for example, the number of necklaces of 
n beads, k of which are to be white, or the number of graphs on 
n vertices with k edges. We might introduce a permutation group 
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Hon B and, with respect to a more general equivalence relation, 
ask for the number of ‘configurations’. (This would be necessary 
for the distribution problem.) 

To begin, we recall Burnside’s lemma, Theorem 10.5: the number 
of orbits of a finite group G on a set X is the average number of 
fixed points: 


1 
(37.2) iq So v0), 


aéG 


where 7(a) denotes the number of points of S that are fixed by the 
permutation a. Recall that this formula followed from counting 
ordered pairs («,a) € X x G such that o(x) = 2. 


Theorem 37.1. Let A and B be finite sets and let G act on A. 
Denote by c,(G) the number of permutations in G that have exactly 
k cycles in their cycle decomposition on A. Then the number of 
orbits of G on the set B4 of all mappings f : A > B is 


1 CO 
ig 2 (BI. 
k=1 


PRooF: By Burnside’s lemma, the number of orbits is given by 
(37.2) where ~(c) is here the number of mappings f : A — B such 
that o(f) = f, ie. f(a) = f(a \(a)) for all a € A. But a mapping 
f is fixed by o if and only if f is constant on every cycle of o; the 
reader should check this. Such mappings are obtained by assigning 
an element of B to each cycle of o, and thus if o has k cycles, the 
number of mappings f fixed by a is |B|*. 


We encourage the reader to pause at this point and to verify that 
Theorem 37.1 produces the answer 13 when applied to the necklace 
counting problem raised as Question 1 above. 

Since it is to be used extensively later, it is appropriate to intro- 
duce the cycle index of a permutation group at this point. First, 
in this chapter, it will be convenient to use the venerable notation 
(12'2 ...n*~) for the partition of the integer n which has k; parts 
of size 7, 7 = 1,2,...,n. Do not compute powers or multiply; this 
is just notation. 
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For a permutation o of a set A, let z;(o7) denote the number 
of cycles of o having length i; so (1717227) ...) is a partition of 
n = |A], called the type of o. Given a group G acting on A, we 
define the cycle index Zg as a polynomial in n letters X,, X9,..., Xn 
by 


ZG(X1, X2,..-,Xn) = a 11 Xenl0), 
o€G 


The assertion of Theorem 37.1 is that the number of orbits of G 
on B4 is 


Za(b,b,...,b = que ee ae 


o€G 
where b := |B|. 


Example 37.1. A cyclic group C,, of order n has y(d) elements 
of order d for each divisor d of n. As a permutation in the regular 
representation of C’,, an element of order d has n/d cycles of length 
d. Thus, for the regular representation of C,,, we have 


i mr 
Ze,(Xay +++ Xn) = = S- p(ayxy. 
d\n 


We can now count the number of ‘one-sided’ necklaces of n beads 
of two colors. The answer is Zc, (2,2,...,2) = F Dod|n 0 (a) 2"/4; of. 
equation (10.12). For n = 6, we get 14, one more than with dihedral 
equivalence (the two necklaces in (37.1) are now distinguished). For 
n = 10, we get 108. 


Example 37.2. To find the cycle index of the dihedral group D,, 
in its usual representation on n points, we must account for the 
permutations (reflections) outside of its cyclic subgroup of order n. 
There are two cases depending on the parity of n: 


on oa y(d) x7 oF nXiXy7 ) for n odd, 


Zp,, = 
n/d n $- 
In (San y(d)X7" Ae gATXS 


n 


1 n 
+ 5X ?) for m even. 
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The number of ‘two-sided’ necklaces with n beads of two different 
colors is Zp, (2,2,...,2). For n = 6, we get 13 (verifying the list 
following Question 1). For n = 10, we get 78. 


Problem 37A. Compute the cycle index of the group of rotations 
of the cube represented as permutations of the six faces. (There 
are 24 rotations—including the identity—that map the cube, as a 
rigid body in 3-space, onto itself. We are ignoring the 24 reflections 
of the cube.) What is the number of essentially different ways to 
paint the faces of the cube in 2 colors? In 3 colors? In n colors? 


Before going further, let us write down the cycle index of the 
symmetric groups. We noted the formula for the number of parti- 
tions of a given type (1"12'2...n*») in equation (13.3). Each block 
of size i may be equipped with a cyclic permutation in (¢—1)! ways, 
so we have 


a2 1 ky yke kn 
28, = 1s THQ" | nPaky lial. Bl! ee an 
(1*12*2...) 


We list the first several polynomials below: 


WZs, = X1, 
WZs, = X? + Xo, 
3!Z5, = X? + 3X1 Xo + 2X3, 
AIZo, = XP + 6X7 Xo + 3X5 + 8X1 X34 6X4, 
5!Zs. = XP + 10X}Xo + 15 XLXF + 20X7X3 + 20X2X3 
+ 30.X,X,+ 24X5, 
6!Z5, = XP + 15X}X2 + 45X72 Xs + 40X23 + 40X3X3 + 15.X3 
+ 120X,X_X3 + 90X?Xy + 9OX2Xy + 144K, X5 + 120X 5. 


Example 37.3. Theorem 37.1 can be used for the enumeration of 
graphs. Let V be a fixed set of n vertices and let E consist of all 
2-subsets of V. Then the simple graphs with vertex set V can be 
viewed as mappings f : E — {0,1}, the graph corresponding to f 
having edge set f~'(1) = fe: f(e) =1}. 
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Two graphs on V are isomorphic if and only if there exists a 
permutation of V taking the edges of one onto the edges of the 
other. To describe this in a manner suitable for the application 
of Theorem 37.1, let S,, be the symmetric group on V and let 
s?) = {o) :¢ € S,} be the induced group, isomorphic to S;,, of 
permutations of FE where 


0) : {x,y} = {o(z),o(y)}- 


Then S$ acts on {0,1} and we note that f,g : E — {0,1} corre- 
spond to isomorphic graphs if and only if they lie in the same orbit 
of S$?) on {0,1}. 

It is not a pleasant task to compute the types of all permutations 
in S?). For example, if ¢ € S; has type (2!3), then it turns out 
that 0) has type (1!3'6!) on the 10 edges of Ks. The cycle index 
of So) is 


1 
aa 10 4 10XPX3 + 15X27 Xd + 20X X38 
+ 20X,X3X6 + 30X_X? + 24X2). 


In general, we will have a sum over all partitions of n. The number 
of nonisomorphic graphs on five vertices is found, by substituting 
2 for all indeterminates, to be 34. 


We now generalize Theorem 37.1 to allow for weights. Let A be 
an n-set, let G act on A, and let B be a finite set of colors. Let R 
be a commutative ring containing the rationals and let w: B— R 
assign a weight w(b) € R to each color b € B. For f : A > B, 


define 
W(f):= ][ w(f@) € BR; 


acA 


here W(f) is the weight of the function f. Note that two mappings 
representing the same orbit of G on B4 have the same weight, i.e. 
W (o(f)) = W(f) for every o € G. The sum 


wif), 


fer 
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extended over a system of representatives R for the orbits, is called 
the configuration counting series. (The orbits are often called con- 
figurations; the term series becomes appropriate when the weights 
are monomials.) If all weights w(b) are 1, then the following theo- 
rem reduces to Theorem 37.1. 


Theorem 37.2. With the terminology as above, the configuration 
counting series is given by 


dW(f) = Ze (Soe. Deer vey >on 


bEB beB beEB 


Proor: Let N = 5>W(f), the sum extended over all pairs (a, f) 
with o € G, f € BA, and o(f) = f. We have 


N= N° W(f)IG¢l, 


feBA 


where G'y is the stabilizer of f. Consider the terms W(f)|Gy| as f 
ranges over an orbit O of G on BA: each is equal to W(fo)|G|/|O| 
where fo is a representative of the orbit O, so these terms sum to 
IG|W (fo). It is now clear that N is |G| times the configuration 
counting series. 


On the other hand, 


Ve VG) 


ce \o(f)=f 


Recalling the definition of Zg, we see that the proof will be com- 
plete if we show that 


w= (mw) (Sjwor*) (stm) : 


o(f)=f beB beB beB 


whenever o is a permutation of type (1'12'2 --.nkn), 
A mapping f : A — B is fixed by o if and only if f is constant 
on every cycle of o on A. Let 


C1, Co,...,Ce, ki= ky thot+---+ky 
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be the cycles of o. The mappings f € B4 fixed by o are in one-to- 
one correspondence with k-tuples (bj), bo,...,b,%) of elements of B 
(the corresponding mapping is the one associating b; to all elements 
of C;). The weight of the mapping f corresponding to (b1,..., bx) 
is 


k 
WA) = [Trot 


S> WP) = SD (wba)! fur by) ]!2! - = fev (b ge! 
o(fy=f bi ,b2,...,bk 


= Dimon) (Stent ote 


as required. 


Example 37.4. Consider the necklace problem with cyclic equiv- 
alence. Here G = C,,, B = {black, white}. We take R to be the 
polynomial ring Q[X] and define w(black) = 1, w(white) = X. 
Then the weight of a coloring f : A — B is 


W(f) = x4 


where k is the number of white beads. 

Thus the number of essentially different cyclic necklaces with n 
beads, k of which are white, is the coefficient of X* in the configu- 
ration counting series 


se oa Pe Pa nel eae 


= Ss: ~~ od) pee x4 n/d 
ae 
n/d 
2 Yr v(a s ee 


d\n r=0 
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This coefficient is 


7 (jy): 


d|(k,n) 


while for n = 12, k = 4, there are 4 (e(1) ('7)+¢(2) (3) +¥(4) (')) 
43 necklaces. 


So if k and n are relatively prime, the number is simply +4 tap 


Problem 37B. What is the number of essentially different ways 
to paint the faces of a cube such that one face is red, two are blue, 
and the remaining three are green? Do this ‘by hand’ (which may 
be quicker) as well as by using Theorem 37.2, and compare your 
answers. 


Let A and B be finite sets, |A] = n, and let G and H be finite 
groups, G acting on A and H on B. The direct product G x H acts 
on B4 when for f € B4 and (c,7) € G x H we define (a,7)(f) by 


((o,7)(f))(a) = r(f (a (a). 
Theorem 37.3. The number of orbits of G x H on B®“ is 


rH >_ Za(ma (7), m2(r),.--,™Mn(7)) 
TECH 


where 
MG) ters WS ge 
ili 


PROOF: By Burnside’s lemma, the number of orbits is 


1 
oq M7) 


o€G,rTEeH 


where ~(o,7) is the number of mappings f € B4 with (o,7)(f) = f. 
To complete the proof, it will suffice to show that for each 7 € H, 


(37.3) ave. aT. 
o€G 
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Fix o € G and 7 € H, and let 
C1, Co,...,Ce, k= 21(0) + 22(0) +--+ + 2n(o) 


be the cycles of g on A. A mapping f : A > B is fixed by (0,7) if 
and only if all the restrictions f; := f|C; are fixed, 1 <i<k. So 
W(o,T) is the product, over i in the range 1 < i < k, of the numbers 
of mappings f; : C; — B satisfying 


fi(o(a)) =T(fila)) for all a € C;. 


Let C; have length @ and fix a, € C;. Suppose that fj; :C; —~ B 
is fixed by (o|C;,7) and that f;(a,) = b. Then f; is completely de- 
termined: f;(a'(a,)) = 7'(b). Moreover, b = f(a.) = fi(o'(ao)) = 
r“(b) and so we see that the cycle of r containing b must have length 
’, a divisor of @. 

Conversely, if b is an element of B lying in a cycle of 7 having 
a length dividing ¢ := |C;|, then we can define a mapping f; on C; 
by fi(o'(a,)) := 7'(b) (check that f; is well defined) and this f; is 
fixed by (a|C;j,7). 

In summary, when |C;| = @, the fixed mappings f; : C; — B are 
equinumerous with the elements of B lying in cycles of 7 of length 
dividing @. The number of such elements is, of course, 


mT) = S 52%5(7). 


jlé 


from which the desired equation (37.3) is immediate. 


Example 37.5. What is the number of ways to distribute 2 red, 
2 yellow, and 4 green balls into 1 round and 3 square boxes? We 
take 


A= {R,, Ro, Vi, Yo, Gi, Go, G3, Ga}, 
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G = S» X So X Sa, B = {r, 81, 89, 83}, H=S, x S3. 


It is easy to see that the cycle index of G is the product of the 
cycle indices of the symmetric groups of which it is the product, so 


(37.4) Ze =Zs,- Zs, + Zs, 
1 


= Sppigy (Xt + X2)°(X + 6XPXo + 3X} + 8X1Xs + 6X,). 


There are three types of permutations in H. If 7 is the identity, 
then 


Mmitj=HA, Saelr)=4) omer) H4, tar) aH + 
if 7 transposes two of {s2, 52, 53}, then 


WT) = 2;. “Holt =4, malt) S2; salt} = 


By Theorem 37.3, the number of distributions is 


is <1 
3! 2I014! 


[(4? +.4)?(444+6-4?-442-47+8-4-44+6-4) 

+ 3(2? +.4)?(2° 4+ 6-2? -4+3-4°+8-2-246-4) 
+2(1?+1)7(1°+6-17-143-17+8-1-44+6-1) 
= 656 (if we did the arithmetic correctly). 


Problem 37C. Given G on A, H on B, find an expression for the 
number of orbits of injective mappings f : A —> B. 


We give, without proof, a statement of the extension of Theorem 
37.3 to include weights. See De Bruijn (1964) for a proof. If all 
weights are 1, then this reduces to Theorem 37.3. 


Theorem 37.4. Let G act on A, H on B, let R be a commutative 
ring and w: B — R. Assume that w is constant on each orbit of 
H on B. With W(f) := [],c4wlf(a)|, the sum }7 W(f), extended 
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over a system of representation for the orbits of G x H on BA, is 
equal to 
1 
— y ZG (M VE ninaclvhe 


where 


ri (b)=b 


Example 37.6. We continue Example 13.5. Take 
w(r)=r, w(s1) = w(s2) = w(s3) = s, 


where R = QJr, s|. The number of distributions with t balls in the 
round box and the remaining 8 — t in square boxes is the coefficient 
of r's8—* in 


1 
7 (Zo(r + 38,77 + 387,73 + 38%, r4 + 3s") 
+ 3Z¢(r+s,r?+3s’, 73453, r1+3s4)42Z¢(r, 134383, r4)) 


where Zg is the polynomial in (37.4). 


Problem 37D. Identify necklaces of two types of colored beads 
with their duals obtained by switching the colors of the beads. (We 
can now distinguish between the two colors, but we can’t tell which 
is which.) Now how many of these reduced configurations are there? 
(For example, with n = 6 and dihedral equivalence, there are 8 
distinct configurations.) How many necklaces are self-dual? 


Problem 37E. The faces of the cube are to be colored with s col- 
ors, and the vertices with t colors (which can be assumed different, 
though it is not important). In how many essentially different ways 
(with respect to the rotation group of the cube) can this be done? 


The theorems of this chapter are also sources of algebraic iden- 
tities. 
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Consider the case of ‘distribution of like objects into unlike cells’. 
(We are recalling something which was discussed in Chapter 13.) 
For example, in how many ways can we distribute 23 apples to 
Fred, Jane, and George? More generally, consider the distributions 
of a set A of n apples to a set B of x individuals (the elements 
of A are ‘like’, whereas the elements of B remain distinct, i.e. are 
‘unlike’). Such a distribution amounts to the selection of a family 
(k, : b € B) of nonnegative integers ky with S°,.p ky =n, ky apples 
going to individual b. 

Thus the number of distributions, as shown in Theorem 13.3, is 


WP eal 
S: Z 
However, the best formal definition of such a distribution is as 
an orbit of mappings f : A — B with respect to the symmetric 
group S, acting on A. (Two mappings f,g € B4 determine the 


same distribution if and only if they ‘differ’ by a permutation of 
the apples.) By Theorem 37.1, the number of distributions is 


1 n 
al S° cr(Sn) x, 
* k=0 


where c;(S,,) is the number of permutations of n letters with ex- 
actly k cycles. In Chapter 13, the numbers c;(S;,) were called the 
signless Stirling numbers of the first kind, and denoted by c(n, k). 
By Theorem 37.2, for every nonnegative integer 2, 


(x +n — 1)(n) = S° Clon z' 
k=0 


and hence this must hold as a polynomial identity. We have there- 
fore re-proved the formula (13.7). 

Take G to be S,, on an n-set A, let B be finite and let w be 
the insertion of B into the polynomial ring Q|B]. Here the weight 
of a mapping f : A — B is a monomial and two mappings are 
equivalent, that is, represent the same configuration, if and only if 
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they have the same weight. The configuration counting series 


(37.5) S" Wf) = Zs, (= Bsoy 0 tae >") 

beB bEB beB 
expresses the so-called homogeneous-product-sum symmetric func- 
tions as a polynomial in the power-sum symmetric functions. For 


example, with n = 3 and B= {X,Y}, 
6(X° + X°Y + XV? +Y%)= 
COL YA BOS VCC ay) Sosa yy), 
With n = 4 and B= {X,Y, Z}, (37.5) expresses 


S- xtyizk 


itjtk=4 
as a polynomial in 


Rey Gh Xa yA ae EV ee. a XO eee, 


Notes. 
We remark that Zs, (X1,..., Xn) is the coefficient of Y” in 


exp (xy + oe Ze aXe? a +) 
as an element of (Q[.X1, Xo,...])[[Y]]. 

The answers provided by Polya theory have the advantage (over 
inclusion-exclusion, say) that the formula produced is the sum of 
positive terms rather than an alternating sum. 

G. Pélya (1887-1985) was a Hungarian mathematician whose 
book Problems and Theorems in Analysis written in 1924 with G. 
Szego is still a classic. But he is perhaps best known for How to 
Solve It, which has sold more than one million copies. He wrote 
papers on number theory, complex analysis, combinatorics, proba- 
bility theory, geometry and mathematical physics. 
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Baranyai’s theorem 


In this chapter we shall give an elegant application of the integrality 
theorem on flows to a problem in ‘combinatorial design’. 


Example 38.1. Suppose we have been entrusted to draw up a 
schedule for the ‘Big Ten’ football teams. Each weekend they are 
to divide into 5 pairs and play. At the end of 9 weeks, we want 
every possible pair of teams to have played exactly once. 

Here is one solution. Put 9 teams on the vertices of a regular 
9-gon and one in the center. Start with the pairing indicated below 
and obtain the others by rotating the figure by multiples of 27/9. 


Explicitly, here is the schedule: 


re-e() reel re-02 re-03 reo4 ree} ree6 ree7 re08 
le08 20e-0() deel 4e02 5e-03 6e04 7e05 8e-06 Oeo7 
2e07 3e-08 dee) 5e-ol 6e-02 7e03 8e-04 Oe05 leeG6 
3e-e6 4e_e7 5e-08 60-00) Te-ol 8e-02 Oe-03 le04 Qe05 


deo} 5ee6 6e-07 T7e08 8e-00 eel le02 20-03 3e-04 


There are many other ways to do this. For example, start with the 
initial pairing shown below. 
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A perfect matching in a graph is also called a 1-factor. A parti- 
tion of the edge set of a graph into 1-factors is a 1-factorization. In 
the above example, we constructed 1-factorizations of Ky). There 
are 396 nonisomorphic 1-factorizations of Ky9, a computer result of 
E. N. Gelling (1973). See also Mendelsohn and Rosa (1985). 


Example 38.2. Let I be any abelian group of odd order. Consider 
the complete graph on the vertex set [.U {oo}. For g € I, let 


My = {{g, o}} U {{a, b} :a+b = 2g, a Fd}. 
Then {M,: 9 €T} is a 1-factorization of the complete graph. 


We consider the following generalization of this problem: let us 
use the term parallel class of k-subsets of an n-set to mean a set 
of n/k k-subsets which partition the n-set. Can the set of all k- 
subsets be partitioned into parallel classes of k-subsets? Of course, 
this necessitates that k divides n. The number of parallel classes 
required would be E(?) = Ca: 

This was not hard for k = 2. It is much harder for k = 3, 
but was done by R. Peltesohn in 1936, and for k = 4 by J.-C. 
Bermond (unpublished). The reader might try to find appropriate 
parallel classes for n = 9, k = 3. It is certainly not clear that our 
generalization always admits a solution and so it was surprising 
when Zs. Baranyai proved Theorem 38.1 in 1973. All known proofs 
use a form or consequence of Theorem 7.2 or 7.4. The proof we 
give is due to A. E. Brouwer and A. Schrijver (1979). 


Theorem 38.1. If k divides n, the set of all (7) k-subsets of 

an n-set may be partitioned into disjoint parallel classes A;, i = 
n—-1 

a ray aed 

PROOF: In this proof, we will use the term m-partition of a set X 

for a multiset A of m pairwise disjoint subsets of X, some of which 
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may be empty, whose union is X. (The normal use of ‘partition’ 
forbids the empty set, but it is important here to allow it to occur, 
perhaps with a multiplicity, so that the total number of subsets is 
m.) 

In order to get an inductive proof to work, we prove something 
seemingly stronger than the original statement. Let n and k be 
given, assume k divides n, and let m := n/k, M := aa We 
assert that for any integer ¢, 0 < @ <n, there exists a set 


Aj, Ao,..., Au 


of m-partitions of {1,2,...,¢} with the property that each subset 
S C {1,2,...,€} occurs in exactly 


(38.1) ( a) 


of the m-partitions A;. (The binomial coefficient above is inter- 
preted as zero if |S| > k, of course, and for S = 9, the m-partitions 
containing () are to be counted with multiplicity equal to the num- 
ber of times the empty set appears.) 

Our assertion will be proved by induction on ¢. Notice that it is 
trivially true for 2 = 0, where each A; will consist of m copies of 
the empty set. Also notice that the case = n will prove Theorem 
38.1, since the binomial coefficient in (38.1) is then 


0 _f1 if|S|=k, 
k—|S|/ | 0. otherwise. 
Remark. This somewhat technical statement is not really more 
general, but would follow easily from Theorem 38.1. If M parallel 
classes exist as in the statement of Theorem 38.1, then for any 
set L of & points of X, the intersections of the members of the 
parallel classes with L will provide m-partitions of L with the above 
property. 


Assume for some value of  < n that m-partitions Aj,,...,Aiy 
exist with the required property. We form a transportation network 
as follows. There is to be a source vertex o, another named A; 
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for each 1 = 1,2,...,M, another vertex named S for every subset 
SC {1,2,...,@}, and a sink vertex 7. There is to be a directed edge 
from o to each A; with capacity 1. There are to be directed edges 
from A; to the vertices corresponding to members of A; (use j edges 
to @, if 0 occurs j times in A;); these may have any integral capacity 
> 1. There is to be a directed edge from the vertex corresponding 
to a subset S to 7 of capacity 


n—-€-1 
k—|S|-—1) 


We exhibit a flow in this network: assign a flow value of 1 to the 
edges leaving a, a flow value of (k — |S|)/(n — £) to the ole from 
A; to each of its members S, and a flow value of (Ga \s|- = to the 
edge from S to tT. This is easily checked to be a flow: the sum of 
the values on edges leaving a vertex A; is 


Se. fa ial poe 


SEA; SEA; 


The sum of the values on the edges into a vertex S is 


5 gee el) tHE) of Bee 
n-€ n—-€\k-\S|J) \k-|S|-1/° 


SEA; 


Since all the edges leaving o are saturated, this is a maximum flow 
and has strength M. The edges into 7 are also saturated in this 
flow, and hence in any maximum flow. 
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By Theorem 7.2, this network admits an integral-valued maxi- 
mum flow f. All edges leaving o will be saturated, so it is clear 
that for each i, f assigns the value 1 to one of the edges leaving 
A; and 0 to all others. Say f assigns 1 to the edge from A; to its 
member S;. For each subset S$, the number of values of 7 such that 
S,= Sis (og 4)- 

(The desired integral flow f could also have been obtained with- 
out explicitly introducing capacities and using Theorem 7.2, but 
simply by applying Theorem 7.4 to the exhibited rational flow com- 
pleted to a circulation with the aid of an extra edge from T to o 
with flow value M.) 

Finally, we obtain a set of m-partitions A‘,...,A, of the set 
{1,2,...,€+1} by letting A’ be obtained from A; by replacing the 
distinguished member S; by S;U {€+ 1}, i = 1,...,M. Readers 
should check that each subset T of {1,2,...,2@+ 1} occurs exactly 


Gone 


times among Aj,...,A‘,. This completes the induction step. 
1 M 


Problem 38A. Let v and wu be integers with v > 2u and v even, 
and consider the complete graph K, as a subgraph of K,. Suppose 
the edges of Ky, are colored with v—1 colors so that distinct edges of 
the same color are disjoint. Show that this coloring can be extended 
to a coloring of E(x) with v — 1 colors so that edges of the same 
color are disjoint. (Such a coloring of E(K,) is equivalent to a 
1-factorization of K,, with the edges of any given color forming a 
1-factor.) 


Notes. 

Zsolt Baranyai (1948-1978) was a Hungarian mathematician who 
was also a professional recorder player. He toured Hungary with 
the Barkfark Consort giving concerts and died in a car accident on 
a country road after one of them. His mathematical work included 
many other wonderful results on ‘complete uniform hypergraphs’. 

One-factorizations of complete graphs are related to symmetric 
Latin squares—see Chapter 17. The result of Problem 38A is due 
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to Cruse (1974). Generalizations may be found in Baranyai and 
Brouwer (1977). 
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Appendix 1 


Hints and comments 
on problems 


Problem 1A. Show that the (3) pairs from {1,...,5} can be used 
to label the vertices in such a way that a simple rule determines 
when there is an edge. To find the full automorphism group, con- 
sider the subgroup that fixes a vertex and its three neighbors. This 
graph is known as the Petersen graph. It was first studied by the 
Danish mathematician J. P. C. Petersen (1839-1910). See also 
Chapter 21. 


Problem 1B. Let the vertex set V be the disjoint union of V; and 
V2, with no edges from Vj to V2. What is the maximum number of 
edges possible? 


Problem 1C. Use (1.1). (i) Consider a vertex of degree 1; use 
induction. (ii) A circuit has as many edges as vertices; if the graph 
is connected, every vertex has degree > 1. 


Problem 1D. Call the vertices aj, a2, a3 and by, bo, b3. First, omit 
a3 and show that there is only one way to draw the graph with the 
remaining six edges in the plane. A better way is to use Euler’s 
formula (see Chapter 33). 


Problem 1E. Each color occurs an even number of times in a cir- 
cuit. See J. A. Bondy, Induced subsets, J. Combinatorial Theory 
(B) 12 (1972), 201-202. The problem can also be formulated as 
follows. Given a square (0, 1)-matrix with distinct rows, it is pos- 
sible to delete a column so that the resulting matrix has distinct 
rows. This fact can also easily be proved by induction. 


Problem 1F. Consider the vertices at distance 2 from a fixed 
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vertex. See Fig. 1.4. 
Problem 1G. Use the pigeonhole principle. 
Problem 1H. Consider when a;; = aj, = 1. 


Problem 1I. Find an inductive procedure for constructing the 
required circuit. 


Problem 1J. For (i), establish a one-to-one correspondence be- 
tween the vertices adjacent to x and those adjacent to y. 


Problem 2B. Any ordering of 22335 works. 


Problem 2C. Let A; be the number of spanning arborescences 
with x; as root and denote by r; the number of edges out of 2;. 
The assertion can be proved by showing that the number of different 
Eulerian circuits in G is 


AMT (7; = 1)!. 


To do this, take 7 = 1 and consider some spanning arborescence 
with x, as root. Fix some edge out of x; as the first edge of an 
Eulerian circuit. Number the other edges out of x; arbitrarily. For 
i = 2,3,...,n, number the edges out of x; arbitrarily but such 
that the edge of the spanning arborescence has number 7;. This 
numbering leads in a natural way to the construction of an Eulerian 
circuit and this process can be reversed. See Chapter 9 of J. H. 
van Lint, Combinatorial Theory Seminar Eindhoven University of 
Technology, Lecture Notes in Mathematics 382, Springer-Verlag, 
1974. 


Problem 2D. One approach is as follows. Show that T,, has the 
property that for each of its edges a, a has ends in different com- 
ponents of G : {e € E(G) : c(e) < c(a)}. Then show that any 
spanning tree T with this property is a cheapest spanning tree. 


Problem 2E. There is only one way to obtain the difference n—1, 
etc. 


Problem 2F. Use Problem 1C. 


Problem 2G. Show that if G is a graph with this property, there 
is a graph G’ with the same property, the same number of edges, 
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and in which the vertex of degree m is connected to all other ver- 
tices with degree > 1. Then use induction (m — m+ 2). The 
construction can be done inductively by the same idea. 


Problem 3A. For part (ii), if the conclusion fails, then H can 
be written as the union of two edge-disjoint subgraphs A, B which 
intersect in two vertices s,t. Apply the induction hypothesis to 
graphs A’, B’ obtained by adding an edge joining s,t to A, B, re- 
spectively. 


Problem 3B. It is possible to use the same type of argument as 
given for Kg (below this problem). Show that there must be a 
vertex that is in two monochromatic triangles. Then, argue on the 
remaining six vertices. The argument will be long and tricky. The 
reader who has cheated and has read on, knows the Corollary to 
Theorem 3.2. 


Problem 3C. Call the terms, both even, on the right-hand side n1 
and ng. Suppose equality holds and consider a graph with nj+n2—1 
vertices. If there is no red K, and no blue K,, what is the red-degree 
of any vertex? How many red edges are there in this graph? 


Problem 3D. In Z;7 consider +2’, with i = 0,1,2,3. For the 
other problem use Z;3. Apply (3.4). 


Problem 3E. (a) Fix a vertex and consider all edges to, respec- 
tively from, this vertex. Pick the larger set and use induction. 

(b) Use the probabilistic method. A solution can be found in 
Theorem 1.1 of P. Erdés and J. Spencer (1974), where this problem 
is used to illustrate the method. 


Problem 3F. Take as vertices {1,2,...,n}. Color {7,7} with the 
color of |i — j|. To avoid a triple for two colors, if 1 is red, then 2 
must be blue and hence 4 must be red, etc. N(2) = 5. Separate two 
such 2-colored configurations by five numbers with a third color to 
estimate N(13). See I. Schur, Uber die Kongruenz 2” + y” = 2” 
(mod p), Jber. Deutsche Math. Ver. 25 (1916), 114-116. 


Problem 3G. Color with four ‘colors’ as follows: 


{t, J} +> (aij, aji) (<9). 
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Problem 3H. Use the argument that was used for Kg and two 
colors. 


Problem 3I. Let the color depend on v (mod 3). Show that the 
sum of two cubes in Fj, is not a cube. 


Problem 3J. Consider a maximal subgraph with the required 
property and then count the vertices not in that subgraph. 


Problem 3K. Consider vertices of a fixed color and delete edges 
in such a way that recoloring is possible. 


Problem 4A. (i) Color Kyj9 in such a way that red corresponds 
to an edge of G, blue to a nonedge. There are triangles with 3 red 
edges, 2 red and 1 blue, 2 blue and 1 red, and finally with 3 blue 
edges. Let a; (¢ = 1,2,3,4) be the numbers of these. Set up a 
system of equations and inequalities for these numbers, expressed 
in the degrees of the vertices of G. These should show that there 
are at least four triangles in G and equality can then be excluded, 
again by looking at the equations. 

(ii) A second solution is as follows. Show there is a triangle. 
Then consider a triangle, the ary 7-set, and edges in between. This 
gives a number of new triangles of two types; estimate it. 


Problem 4B. Use induction. First review the proof of Theorem 
4.1. 


Problem 4C. The number of triangles on an edge a with ends x 
and y is at least deg(x) + deg(y) — n. Sum this over all edges a. 


Problem 4D. Fix any vertex x and let D; denote the set of ver- 
tices at distance 7 from x in G. 

For g = 2t+1, |\V(G)| > 1+r+r(r—1)4+---+r(r—1)! since 
it is easy to see that |D,| = r(r —1)*"' for i=1,2,...,¢. 

For g = 2t, we still have |D;| = r(r —1)*"! for i=1,2,...,t—-1. 
We can also get a lower bound |D;| > 1D i4 since each vertex 
y € Dy; is adjacent to r — 1 vertices in D; while a vertex y € D; 


can of course be adjacent to at most r vertices in D;_. 
Problem 4E. The graph is bipartite. 


Problem 4F. By Theorem 4.3, there exists a Hamiltonian circuit 
HT. Assume that H does not contain an edge e with ends x and y. 
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Let x’, respectively y’, be the successors of x, respectively y, on H. 
Consider the path P from 2’ to y along H, then along {y,x}, and 
from x to y’ along H in the reverse direction. If the edge {2’, y’} is 
in G, we are done. If not, argue as in the proof of Theorem 4.3 to 
show that P can be completed to a circuit by removing one edge 
and adding two others. 

For a further generalization, see L. Lovasz (1979). 


Problem 4G. Show that the maximum value of S occurs when 
z; > 0 only on points of a clique. 


Problem 4H. Count pairs of edges with a common vertex in two 
ways. 


Problem 5A. (i) Given A C X, one side of a bipartition, count 
the number of edges that have one end in A and the other in T'(A). 
(ii) A trivalent graph with no perfect matching is the graph on 
four vertices and six edges with one vertex incident with three non- 
loops and loops incident with the other three vertices. Replace the 
loops by appropriate simple graphs. 
(iii) Same hint as for part (i). 


Problem 5B. Construct a sequence of sets A; of size m; using new 
elements only when it is necessary to keep condition H satisfied. A 
solution is given on p. 41 of Van Lint (1974). 


Problem 5C. Use induction on the number of nonzero entries of 
the matrix. The argument of Theorem 5.5 can be used similarly. 


Problem 5D. Theorem 5.5. 


Problem 5E. Express the number of SDR’s with 1, respectively 
2, representing A; in terms of S; with i <n. See Problem 14A. 


Problem 5F. Count the number of sets {x1, 72,...,v,} with x, € 
Aj, for all k and with x; = 2;. 


Problem 5G. (i) is easy but (ii) gets messier. Consult Example 
10.1 


Problem 6A. Define a poset using both the index and the size of 
the integers a;. 
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Problem 6B. (i) Consider A and A. (ii) Let x be in all the sets. 


Problem 6C. Show that large sets can be replaced by their com- 
plements, and apply Theorem 6.5. 


Problem 6D. (i) No two of the subsets are in the same chain. (ii) 
Use induction. 


Problem 6E. See C. Greene and D. Kleitman, Strong versions of 
Sperner’s Theorem, J. Combinatorial Theory (A) 20 (1976), 80-88. 


Problem 7A. This is a straightforward application of the algo- 
rithm. The minimum cut capacity is 20. 


Problem 7B. Given a flow f with nonzero strength, show that 
the edges e with f(e) # 0 contain the edges of a simple directed 
path from s to t. Subtract a scalar multiple of the corresponding 
elementary flow to obtain a flow f’ which vanishes on more edges 
than f and proceed by induction. 

Four vertices suffice for the required example. 


Problem 7C. Consider a maximum flow f. From Equation (7.1), 
edges from X; to Y; are saturated and edges from Y; to X; have 
zero flow value. 


Problem 7D. Given the bipartite graph G with vertices X UY , 
construct a network by adding two vertices s and t, edges directed 
from s to elements of X of capacity 1, and edges directed from 
elements of Y to t of capacity 1. Direct all original edges from X 
to Y; it is convenient to assign them a large capacity, say |X| + 1, 
so that no such edges are present in a minimum cut. Explain why 
there exists a complete matching from X to Y in G if and only if 
this network has maximum flow strength X. 


Problem 7E. For (iv), apply Theorem 7.2, with d = 2, to the 
incidence matrix of the graph. (This result is also a quick conse- 
quence of Theorem 1.2.) For (v), consider the bipartite incidence 
matrix M of a bipartite graph G, which is defined as follows. Say 
G has bipartition (X,Y). The rows of M are indexed by the ver- 
tices in X, and the columns of M are indexed by the vertices in Y. 
The entry in row x € X and column y € Y is the number of edges 
joining x and y (0 or 1 if G is a simple graph). 
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Problem 7F. Let 7 be a spanning tree in D. Show that for any 
assignment of real numbers to the |E(D)| — |V(D)|+ 1 edges of D 
not in T,, there is a unique way to extend this to a circulation f on 
D. It may be useful to remember that a tree with at least one edge 
has a monovalent vertex. 


Problem 8A. Assume that some subsequence of n consecutive 
elements occurs twice. Show that a satisfies an equation of degree 
less than n. 


Problem 8B. The vertices of the digraph are the ordered pairs 
from {0,1,2}; the edges are the 27 ordered triples. 


Problem 8C. The sequence must contain four 0’s and four 1’s. 
Three consecutive 0’s force 00010111 which does not work; the pair 
00 must occur twice and this then forces 00110011 which does not 
work either. 


Problem 8D. Since every vertex in G,, has indegree 2 and outde- 
gree 2, we do get a closed path from 00...0 to 00...0. The final 
edge must come from 10...0 (which we identify with the integer 
1 in binary). The fact that this edge is used implies that the edge 
from 1 = 10...0 to0...01 has already been used. So, we came into 
the vertex 1 twice, i.e.from 2 = 010...0 and from 3 = 110...0. 
For the same reason as above, both 2 and 3 were entered twice. 
Proceeding by induction, we see that all vertices were visited twice 
on the path through G,,, proving the assertion. 


Problem 9A. Apply the algorithm. 


Problem 9B. A trivial addressing of length n as in Theorem 9.2 
is easily found. What is the diameter of the graph? 


Problem 9C. Use Theorems 9.1 and 9.6. Somewhere in the calcu- 
lation 5) kcos(ka) will appear. This is the derivative of }> sin(kz) 
and that sum is determined by multiplying by sin($2). 


Problem 9D. See Theorem 9.7.4. 


Problem 10A. Use Theorem 10.1, where F; (i = 1, 2,3, 4) are the 
integers < 1000 divisible by 2,3,5,7, respectively. 
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Problem 10B. Use inclusion-exclusion and the fact that if f(i) = 
0, then (a — 7) divides f(z). 


Problem 10C. Use |x| = }¢,<,1 and Theorem 10.3. 


Problem 10D. Multiply }>a,n~* and 5>b,m~*. Determine the 
coefficient of k~*, and use Theorem 10.3. 


Problem 10E. Determine }/j,,, log fa(z); use Theorem 10.3 or 
10.4. 


Problem 10F. There are clearly 2n + 1 colorings. Use inclusion- 
exclusion where a set E; consists of colorings with 7 red and 7 — 1 
blue. To determine N;, use Example 10.6. 

For a direct solution, calculate 


3 s7(-1)! co ‘) g2n—2k. 7.2 


n=0 k=0 


To do this, use (10.6) and the fact that )>(2n + 1)?” = (;s)’. 
Problem 10G. Use Theorem 10.1. 


Problem 10H. Count permutations of 1,2,...,n that fix none of 
Tes 


Problem 11A. Consider an all-one matrix of size n by k. Adjoin 
a column of 0’s and then complete to a square matrix by adding 
rows of type (0,...,0,1). 


Problem 11B. (i) Theorem 5.3; (ii) Theorem 11.5; (iii) use a 
direct product of matrices Jz. 


Problem 11C. Determine A = (a;;) with a;; = |A;M B;|. To 
find the permanent of this matrix, compare with the probleme des 
ménages. 


Problem 11D. Use Theorem 11.7. 


Problem 11E. Calculate the corresponding permanent B,, by ex- 
panding by the first row and column. Show that B, = B,_; + 
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Bn-2 — 2. So {B, — 2|n > 3} is a Fibonacci sequence. Also see 
Problem 5E. 


Problem 12A. Use Theorem 12.1. 


Problem 12B. Suppose AA! is decomposable. Explain the block 
of 0 entries by considering some row of A, its nonzero entries, and 
the inner products with other rows. 


Problem 12C. Do four averaging operations on pairs of rows and 
similarly for columns. Note that the element as; does not change 
during these operations. Determine the permanent and minimize. 
This shows that the matrix was zJ from the start. 


Problem 13A. Use Theorem 13.1. 


Problem 13B. (i) Multiply two powers of (1 — x). (ii) Let k be 
in position a+ 1; what can precede this term? 


Problem 13C. (i) Choose A; and then choose Az as a subset 
of its complement; determine the sum of products of binomial co- 
efficients. (ii) Consider a 2 by n matrix for which the rows are 
the characteristic functions of A,, respectively Ay. Translate the 
condition on A; and A» to a condition on this matrix. 


Problem 13D. First, fix the union U of the sets in A. Apply 
Example 13.6 and then sum over all choices of U. The answer 
shows that we could have found a better solution. To find this 
better solution, again depict A by a (0,1)-matrix of size k by n. 
To this matrix adjoin a ‘special’ row, namely the characteristic 
function of the union of the sets of A. Calculate the number of 
these (0,1)-matrices that have a 1 in a specified position in their 
‘special’ row. 


Problem 13E. In each path, consider the last step of type (x, y) > 
(c+ 1,y+1). Use Theorem 13.1. For the last question, insert k 
balls into n+k possible positions; let x; be the length of the interval 
preceding the k-th ball. 


Problem 13F. (i) Prove by induction. If a permutation of 1 to 
n — 1 is written as a product of cycles, there are n ways to adjoin 
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the element n, one of which has one more cycle. (ii) In Theorem 
13.7, sum both sides over k; use (13.5). 


Problem 13G. Insert (13.13) into (13.11) and change the order 
of summation. 


Problem 13H. Use (10.6). 
Problem 131. Use the relation F(a) = G(a) of page 123. 


Problem 13J. Use the binomial formula with x = ¢, where ¢? = 
1. 


Problem 13K. Count partitions of an n-set into n — 2 parts di- 
rectly. 


Problem 14A. (i) The sequences end in 10 or in 1. Establish the 
Fibonacci recursion and solve by trying a, = t” as a solution (use 
linearity). The method of Example 14.3 also can be used. 

(ii) Express 6, in terms of the numbers c,, of such sequences end- 
ing in a1. Find a recurrence for c,. The constant c will be the 
largest root of a polynomial equation of degree 4; it is approxi- 
mately 1.22075. 


Problem 14B. Consider a trivalent planted plane tree 7. Make 
a new tree 7” as follows. Take as vertices the root of T and all its 
vertices of degree 3. Join two vertices in T” if there is a path from 
one to the other in T using exactly one edge with slope to the left 
(see Figure 14.3). Finally, add a new root at the bottom. Show 
how to reverse this mapping. Show that the number of vertices of 
T’ equals the number of vertices of degree 1 in T. See p. 25. of J. H. 
van Lint Combinatorial Theory Seminar, Eindhoven University of 
Technology, Lecture Notes in Mathematics 382, Springer—Verlag, 
1974. 


Problem 14C. Fix one edge; number others cyclicly: a,b,c,.... 
Give the chords the obvious names; be careful about the order in 
which these names are given. See also p. 25. of Van Lint (1974), 
ibid. 

Problem 14D. Use the method of Fig. 2.2. One fixed point pro- 


duces an arborescence; the remainder is a mapping with no fixed 
points. See (14.15). Take M\(0) = 1, A(0) = 0. 
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Problem 14E. Fix an edge and then consider the quadrilateral 
that it belongs to. The other three sides divide the (n+ 1)-gon into 
possibly empty polygons; see (14.10). Solve f = 2+? by applying 
Theorem 14.3. 

A combinatorial proof that the number of ways of dividing an 
(n + 1)-gon into quadrangles by nonintersecting diagonals is 0 if n 
is even, and Wot Grr) if n = 2k +1, can be given as follows. Since 
the case of even n is trivial, we assume n = 2k +1. There are k—1 
diagonals and k quadrangles in the dissection. Put a vertex in each 
quadrangle and on each edge; fix one edge-vertex as a root. Vertices 
of quadrangles are joined by an edge if the quadrangles share a 
diagonal; an edge-vertex is joined to the quadrangle-vertex of its 
quadrangle. Describe a walk around the tree, that was just defined, 
by a sequence of symbols x and y, where we use an x every time we 
reach a quadrangle vertex not visited before, and similarly a y for 
edge-vertices. We find a sequence of k symbols x and 2k+1 symbols 


y. We know, see Example 10.6, that there are ga (Ot *) circular 


sequences of this type. It remains to show a preone. corressondence 
between these circular sequences and the sequences describing the 
trees. Describe the x,y sequence as follows: an x corresponds to a 
step in the X-Y plane one unit to the right and one unit upwards; 
a y to a step one unit to the right and k/(2k + 1) steps downward. 
The walk starts in (0,0) and ends in (3k + 1,0), never meeting 
the X-axis in between. If we describe a circular sequence of the 
given type in the same way, it will have a unique minimum because 
(k,2k +1) =1. This will give us the unique starting point to make 
it correspond to one of the trees. 


Problem 14F. See Appendix 2 on formal operations. Take loga- 
rithms and use Theorem 10.3. 


Problem 14G. Show equivalence with Example 14.11. 


Problem 14H. Make a path as in Example 14.8 and find a lowest 
point. 


Problem 14I. (i) See (14.10). (ii) Put a vertex on the circle, say 
between 1 and 2n. This will be the root of a tree. Put a vertex in 
each region and join vertices in the obvious way. Show equivalence 
with one of the Catalan problems. See Van Lint (1974), p.26. 
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Problem 14J. A labeled regular graph of valency 2 is a union of 
(labeled) polygons. Apply Theorem 14.2. 


Problem 14K. Call the number of walks Ay, and define A(z) : 
1+ 0%, Ainz’”. If Bois the number of walks from (0,0) to (n,n 
that avoid points (i,i), then we know from (14.12) that B(z) : 
Soe Bonz?” = 1- V1 — 422. Find a relation between A(z) and 
B(z) by considering the first crossing of x = y for each walk. 

It is also possible to prove the result by finding a one to one 
mapping from the walks counted by Ay, to the walks counted in 
Example 14.8. This is difficult. For a solution see the solution to 
Problem 3096 by W. Nichols in American Mathematical Monthly 
94 (1987). 


Problem 14L. Show that a, = 3an_1 —a@n_2 and that F»,, satisfies 
the same recurrence. 


| = || 


Problem 14M. For g with k fixed points, count their number and 
the number of appropriate f and sum over k; for f with image of 
size i, split {1,2,...,r} into preimages, count the f and the number 
of appropriate g; sum over i. Use (13.9) and (13.11). This is the so- 
lution to Problem 3057 of the American Mathematical Monthly 94 
(1988), given by J. M. Freeman, S. C. Locke, and H. Niederhausen. 


Problem 14N. Introduce b;,, := the number of walks from (0,0) 
to (n,n) that meet the line x = y for the second time in (k, k). 


Problem 15A. Distinguish between x, = 1 and x, > 1, in which 
case take y; = x; — 1. Use induction. 


Problem 15B. Let F, J, and S denote the number of equilateral, 
isosceles, and scalene triangles, respectively. Calculate E, J + E, 
and express (3) in E, J, and S. 

(This proof is due to J. S. Frame, Amer. Math. Monthly 47, 


(1940), 664.) 
Problem 15C. Use partial fractions. Find c using the coefficient 
of (1 — x)~* and (10.6). 


Problem 15D. Count the compositions of n into k parts which 
have a 3 in position j. Do this for all 7 and sum; (a composition 
with m parts 3 is thus counted m times). 
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A second solution with a more general result is as follows. First 
show that in a list of all possible compositions there are exactly 
(n+1)-2”- integers. This can be done using the argument with 
the red balls. Then prove by induction on n that for 1 <m <n 
a list of the 2”~' compositions of n contains the integer m exactly 
(n —m+3)2"-"~ times. For this, use the formula 1+ S7721(k + 
3)2*-? = (n+ 1)- 2"? which is also shown by induction. 


Problem 15E. Leave out the first column. 


Problem 15F. For the unequal odd parts, use ‘hook’ shaped fig- 
ures to form a Ferrers diagram and read this in the usual way. 


Problem 15G. Show a correspondence with Example 14.8. 
Problem 15H. See Chapter 13. 

Problem 15I. Generalize Theorem 15.4. 

Problem 15J. Apply Theorem 15.10 to an n x n square. 


Problem 16A. Use induction; compare two initial segments of r 
and s. 


Problem 16B. Determine the first column of a matrix with row- 
sum r and column-sum r*. Use induction. 


Problem 16C. Use Problem 16A and switch edges. Another pos- 
sibility is to describe the graph by its adjacency matrix and use the 
same idea as in the proof of Theorem 16.2. 


Problem 16D. Estimate the number of solutions of s;+---+s, = 
sn” by just using the fact that each s; is at most n. 


Problem 16E. For the ‘if’ part, first note that there exists a tour- 
nament with degree sequence (n—1,...,2,1,0). Then use Problem 
16A. Show that if in a tournament the outdegree of x is greater 
than the outdegree of y, then there is a tournament on the same 
vertex set and with the same outdegrees except that outdegree of x 
is one less than it was originally, and the outdegree of y is 1 greater. 
One can do this by changing the directions on at most two edges. 


Problem 16F. For the generalization, one considers the hyper- 
graph given by choosing the first m of the k-subsets in lexicograph- 
ical order. For the ‘if’ parts, again use Problem 16A. 
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Problem 16G. A(5,3) = A(5,2). See (16.2). Counting directly is 
possible as follows: (i) Fix the first row as (1,1,0,0,0); this means 
that the final result must be multiplied by 10. (ii) Put a 1 in 
position (2,1); this means that we should also multiply by 4. (iii) 
Split into two cases: a 1 in position (2,2), etc. or a 1 in position 
(2,3), in which case a factor 3 is introduced, etc. 

Another solution is obtained by observing that A*(5,2) has car- 
dinality $5!4! = 1440, and that there are (ayy? - 3! = 600 decom- 
posable matrices in A(5, 2). 


Problem 16H. In case 1, there are n—1 possible choices. In case 
2, remove row 1 and column 1, substitute an element of A(n— 1, 2) 
and in two ways then produce an element of A(n, 2). 


Problem 161. Straightforward substitution. 


Problem 17A. Index the first two rows by x and y, the first two 
columns by wu and v, all under the assumption that we have the 
multiplication table of a group of order 5. Show that (v~!u)? = 1. 


Problem 17B. This is already difficult, despite the small size of 
the square! Take a = 1. Note that the 1 in the upper left-hand 


corner is in subsquares of type e . for 7 = 2,3,4,5. Consider 


any Latin square with this property. If the (2,3) entry is not 4, 
then permuting rows 4 and 5, columns 4 and 5, and symbols 4 and 
5, changes the square into an equivalent one with a 4 in position 
(2,3). At this point, there is only one way to complete the square. 
So, if a Latin square has any cell with a property like the one we 
mentioned for the upper left hand 1, then it is equivalent to this 
one. 

Using the same kind of argument one can show that there are 
only a few possibly inequivalent squares of order 5. For some, one 
then has to establish the equivalence, a tedious task. 


Problem 17C. At this point, trial and error is prescribed. How- 
ever, see Chapter 22. 


Problem 17D. Sufficiency is shown by an example (you can, but 
do not have to, use Theorem 17.1). For necessity, observe that just 
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appending one more column to a Latin square of order m requires 
m new symbols. 


Problem 17E. Apply the algorithm. 


Problem 17F. For part (i), describe an algorithm that recovers 
the original Latin square of order n from the constructed Latin 
square of order n+ 1 with constant back diagonal. For (ii), explain 
why the number of Latin squares of order n with n’s on the back 
diagonal is equal to N(n)/n!. 


Problem 17G. (a) For n = 6, an example is 6,1, 5, 2, 4, 3. 
(b) Add the differences. 
(c) Assume two pairs are equal and solve. 


Problem 17H. Start by replacing the diagonal entries of the 
smaller square by (n + 1)’s. 


Problem 17I. See Example 10.6. 


Problem 17J. (i) Use inclusion exclusion. If & integers are in the 
wrong place, write k = 22 + 7 where there are 7 pairs in the wrong 
place, etc. 

(ii) Use J — I of size n. 


Problem 18A. To show this directly, argue as in Theorem 18.1 
concerning the first three rows. Show that no further row can start 
with three +1’s. Hence, rows 5,6, and 7 all start with + + — 
(why?). These can then be completed in a unique way, etc. For 
another solution, see Problem 19D. 


Problem 18B. Use the method of Theorem 18.1. Note that a 
permutation of rows and columns, that preserves the 0-diagonal, 
does not help us to make a matrix symmetric or antisymmetric. 
So, only multiplications by —1 will do the job. Normalize the first 
row and make the first column (except for the Oth entry) all 1 or 
all —1. It is now sufficient to show that the (2,3) entry and the 
(3,2) entry are the same, respectively opposites, whenever n = 2 
(mod 4), respectively n = 0 (mod 4). 


Problem 18C. A trivial exercise! 


Problem 18D. We have 28 = 1+9+9+49 and 28 = 1+14+1+425. 
Use the first of these. Note that each W; has 1 on the diagonal 
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and if it has w;; = 2, then w7_;,; is also 2; similarly for —2. The 
observation about occurrences of the U’ in the matrices W; then 
completely forces the choice of the four matrices W;. They prove 
to work. 


Problem 18E. Show by induction that Mi)... M0") = Hym-in1® 
I,:-1. For the second part, count the number of non-zero elements 
of M\. See the reference E. C. Posner (1968). 


Problem 18F. For both questions use induction. 


Problem 18G. For (2), show that the number of rows with first 
coordinate 0 (or 1) is at most n. 


Problem 18H. Straightforward; see Theorem 18.4. 
Problem 181. Review the properties of Q. 


Problem 19A. For the isomorphism in part (i): Label the vertices 
of Kg with 0000, 1000, 0100, 0010, 0001, 1111 in F} and label an 
edge with the sum in F} of the labels on its ends. This gives a one- 
to-one correspondence between the edges of Kg and the 15 nonzero 
vectors in F}. 

For part (ii), consider configurations consisting of three edges 
(there are four types). The construction is due to E. 8. Kramer 
(1990), Discrete Math. 81, 223-224. 


Problem 19B. Calculate b;, i = 3,2,1,0; b; ¢ Z. 
Problem 19C. This is Theorem 19.4. 


Problem 19D. (i) Let N have column-sums c;. Then Soc; = 
66. The sum of all inner products of rows is }> (%). Use the 
Cauchy-Schwarz inequality. Conclude that all column-sums are 6 
and all inner products are 3. (ii) To show uniqueness, consider the 
complement: a 2-(11,5,2) design. Without loss of generality, the 
first row is (11111000000); the remaining ten rows start with the 
(3) pairs (110...), (1010...), etc. Then show that the rows 2 to 
5 can be completed in one way. This leaves a 6 by 6 square to be 
completed. At first sight, there seem to be two ways to do this. 
Find a permutation that maps one into the other. 
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Problem 19E. Denote by a; the number of blocks that meet B in 
i points. Calculate }> a;, 5>ia;, and S> ) a;. Derive an inequality 
for aj using the form S*(i — c)?a;. Compare with Theorem 19.9. 


Problem 19F. The mapping x +> 2z has six orbits, two of which 
are too long to use (e.g. (1,2,4,8,16,11)). See Example 28.2. 


Problem 19G. Use the corresponding orthogonal array. Use the 
common {0,1} to {1,—1} mapping. 


Problem 19H. For (1) use (19.2) or (19.4). For (3) count pairs 
p,q € B that are contained in blocks that meet p. Thus find the 
parameters of D?. For (4), first suppose that D? is a single block. 
Consider Example 19.3. In the other case Fisher’s inequality yields 
k > (A+1)(A+ 2). Now, express bp as a fraction in k and 4 and 
deduce that k must divide 2(\ + 1)(A + 2). 


Problem 191. (i) Fix a point x on O. Count pairs (y, L), y € O, 
[a line through z and y. (ii) From (i) we see that if |O|] = n+ 2, 
then every line that meets O, meets it twice. Take a point z ¢ O 
and count the lines through z that meet O. (iii) Take any four 
points, no three on a line. These determine six lines, that together 
contain 19 points of the plane. 


Problem 19J. For p; to ps take (1,0,0), (0,1,0), (0,0,1), (1,1,1) 
and (1,t,¢?) and show that there are two ways to group these to 
produce pg = (1, 2,27), where x = t/(1+1%). 


Problem 19K. Let B,; ~ By. How many blocks meet B,? Take 
x € By, y € Bo; how many blocks meet both B; and Bj? There are 
n+ 1 equivalence classes, each with n lines (blocks). To each class 
append a point that is to be added as an extra point to each line 
of that class, etc. 


Problem 19L. Use Theorem 19.11. Consider the equation mod- 
ulo 3. 


Problem 19M. For v even, use quadratic forms, as in the proof 
of Theorem 19.11. See J. H. van Lint and J. J. Seidel (1966), 
Equilateral point sets in elliptic geometry, Proc. Kon. Nederl. Akad. 
Wetensch. 69, 335-348. 
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Problem 19N. (a) The method of Example 19.14 also works if 
\V| = 1; (85 = 7113 — 1) +1). (b) 15 = 7(3-1) 41. 

Problem 190. Consider the n x 6 incidence matrix N, where 
b=|Al|. Check that in case (i), N'N is nonsingular modulo 2. For 
part (ii), let Ny be obtained by adding column of all 1’s to N, and 
show that Ni N, has rank at least b modulo 2. 


Problem 19P. The last of the displayed sums should evaluate to 
zero. This implies each py; is 1 or 2. 


Problem 19Q. Use induction; start the induction with t = 2. 
Count, in two ways, ordered triples (x, A,B) where A and B are 
distinct blocks, both of which contain the point x. 


Problem 19R. Represent lines as in Example 19.6 and use the 
fact that this is a cyclic representation. Distinguish different cases 
by the number of ‘first’ elements used in the SDR. 


Problem 198. It is possible to give a construction using induction 
on k, but it is quicker to give a direct construction using F¥ as the 
point set. Review Example 19.1. 


Problem 19T. Use the result of Problem 19H. The extensions 
will appear in Chapter 20. 


Problem 19U. Apply Theorem 5.5. 
Problem 20A. Use (20.5). 


Problem 20B. Apply the pigeonhole principle to the last two bits 
of the words. A binary code with three words is easily handled! 


Problem 20C. For each of the three ways in which the conditions 
for a codeword can be violated, change a suitable coordinate to 
remedy the situation. 


Problem 20D. (i) The rows of the generator matrix correspond 
to ta;, 7 = 1,...,n, where the a; are the rows of a Hadamard 
matrix. Let u € EF’, where n = 27. By u* we denote the +1 
representation. Calculate 5>(u*,a;)? to find the upper bound. 

(ii) For equality, first find the weight of z. Then use the fact that 
the codewords correspond to linear functions and adding a linear 


function to x yields an equivalent quadratic form. 
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Problem 20E. (i) See Problem 20A. (ii) Count words of weight 5 
‘covered’ by codewords of weight 7; the other words of weight 5 are 
covered by codewords of weight 8. Using this idea, set up a system 
of linear equations for the A;. 


Problem 20F. A codeword at distance < e from a word of weight 
e+1 must have weight 2e + 1. 


Problem 20G. The codewords in the dual of the binary Hamming 
code of length 2” — 1 may be identified with linear functionals on 
FS (cf. the geometric description of Reed-Muller codes in Chapter 
18); each nonzero codeword has weight 2”~1. 


Problem 20H. Use (19.6) and the fact that G2, is linear and has 
minimum weight 8; note that there are (3) pairs of lines. If a = 1, 
consider the 7-point configuration B* in the plane. Any line that 
meets B* must meet it in 1 point or in 3 points. How many meet 


B* in 3 points? 


Problem 20I. (i) Find the standard generator matrix for C. (ii) 
The conditions (1) and (2) clearly define a linear code. In (1) 
we have two choices for parity; in (2) we have 4° choices for the 
codeword; in five of the columns of A we then have two possibilities 
(not in the last one!). For the choice ‘even’, it is clear that a 
nonzero codeword in C’ forces weight > 8; O has to be treated 
separately. For the choice ‘odd’, it is clear that we obtain a weight 
of at least 6; condition (1) ensures that equality cannot hold. For 
more on this wonderful description of the Golay code and many 
consequences see: J. H. Conway, The Golay codes and the Mathieu 
groups, Chapter 11 in J. H. Conway and N. J. A. Sloane, Sphere 
Packings, Lattices and Groups, Springer-Verlag, 1988. 


Problem 20J. Self-duality is done as in Example 20.6. All weights 
in Sym, are divisible by 3. Consider linear combinations of two 
rows of the generator matrix and then conclude what happens if 
we combine with a third row. Use (20.5). 


Problem 20K. Prove the generalization by induction on r. For 
each r, find such a matrix none of whose columns is a constant 
vector. For the induction step, concatenate three copies of the 
matrix with r rows, to which have been appended a row of all 0’s, 
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all 1’s, and all 2’s, respectively, and then add three appropriate 
columns. 


Problem 20L. The hint in the chapter should be sufficient. 


Problem 21A. The argument is the same as for (21.4), now with 
inequalities. 


Problem 21B. Use the integrality condition. Also see the refer- 
ence A. J. Hoffman and R. R. Singleton (1960) from Chapter 4. 


Problem 21C. Use (19.2); see Problem 20F. If two blocks are 
disjoint, count the number of blocks that meet both of them, each 
in two points. How many meet only one of them? Similar counting 
arguments (distinguish a few cases!) are used to show that ps = 6. 


Problem 21D. Suppose such a graph exists. Pick a vertex x and 
consider the sets I(x) and A(x). We consider these as the points 
and blocks of a design. Show that this is a 2-(9,4,3) design. Next, 
fix a block of this design and let a; be the number of blocks that 
meet this block in i points. Show that ap < 1 (see Problem 19E) 
and show that the fact that A(a) has degree 5 implies that ao 
should be at least 5. 


Problem 21E. The first two questions are straightforward. For 
the third we refer to textbooks on matrix theory (see ‘interlacing’) 
or to Chapter 31. The theorem we are alluding to states that the 
eigenvalues of a principal submatrix B of a symmetric matrix A are 
bounded by the largest and the smallest eigenvalue of the matrix A. 
To prove this, just consider any vector x and calculate x! Ax/x!x. 
Specialize by letting x be an eigenvector of B with 0’s appended. 


Problem 21F. Review the proof of Theorem 21.5. Show that 
I(x) = Ky-1n-1. Define a grand clique to be a clique of size n and 
show that any edge is in exactly one grand clique. Show that there 
are exactly 2n grand cliques, which fall into two classes of n so that 
grand cliques in the same class are disjoint. See S. S. Shrikhande, 
The uniqueness of the [2 association scheme, Ann. Math. Stat. 30 
(1959), 781-798. 


Problem 21G. Count flags and other configurations, e.g. two in- 
tersecting lines and a point on one of them. 


562 A Course in Combinatorics 


Problem 21H. See Problem 21E for v; k is obvious. Take two 
points on line L. Points joined to both of them are on L or on one 
of the other R — 1 lines through one of them. A similar argument 
yields y. For r and s see (21.6). 


Problem 211. On [(«) use the fact that w~ = 1. Then consider 
{x}UT (2). 


Problem 21J. Eq. (21.8) leads to an equation like (21.6) with t 
instead of k. Although A is not symmetric the eigenvectors for 
eigenvalue # k are orthogonal to j. 


Problem 21K. Show that if (A — )? + 4(t — 2) is not a square, 
then the eigenvalues have multiplicities 1,f, and g, where f = g = 
$(v—1). Then show that A = A’ = J—I and AA' =IJ+(I+1)I, 
where v = 41+ 3. Define Q:= A— Al. 


Problem 21L. Let d:= \/(A— “)? + 4(t — uw). Apply (21.8) to j 
and then show that d divides (k — 1)?. Using 4k — 7 = d?, prove 
that d must divide 9. 

For v = 18, use the matrix 


( cl ae 


(e-1)I4+J el 


and replace c by the matrix C of Example 21.11 and d by C? to 
obtain a matrix A of size 18 x 18. 


Problem 21M. (i) Consider [(2) NT(y) where (xy) is an edge. 
(ii) Prove that G is also neighborhood regular. 


Problem 21N. (i) ['(a1) = CUB is regular with degree a, so 
ICB| = alC| — |CC, etc. 

(ii) Eliminate |BC]| and |BD| from the relations in (i). 

(iii) Interchange x; and x». Use d, F do. 

(iv) Express d; in k;, a,@, and the number of vertices n of G. 
Then prove that ky + ky = 2n — 3a@—a—5. 


Problem 210. Part (i) is straightforward. For example, A’ = 
(n+ 3)/2. (ii) If four triples of a clique C contain a point x, then 
all triples of the clique must contain x. (iii) If n > 15, the cliques 
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of size (n — 1)/2 in the graph of a triple system are in one-to-one 
correspondence with the points of the triple system. 


Problem 21P. Equality would imply the existence of a srg(a? + 
a+1,a+1,1,1). Use Theorem 21.1. Only a = 1 is possible and a 
Ky: obviously has no circuit on four vertices. 


Problem 21Q. The graph G representing the friendship relation 
is either a union of triangles with one vertex common to all of them 
or G is regular. Use Theorem 21.1. 


Problem 22A. Show, by computation, that the complement of 
the graph of an (n,n)-net has the parameters of the graph of an 
(n, 1)-net. Then show that it 7s the graph of an (n, 1)-net, i.e. there 
is a partition of the points into n sets of size n, no two points of 
which are collinear. 

(More generally, an (n,1r)-net may be completed to an (n,n +1)- 
net if and only if the complement of the graph of the (n,7r)-net is 
the graph of an (n,n + 1 —r)-net.) 


Problem 22B. Use the constructions from fields described before 
the problem. Take S(x,y) := (a+ y)/2 for q odd, for example. 

For even q, we can use A and S to schedule mixed-doubles tennis 
matches. Suppose we have q couples—Mrs. 7 and Mr. 2 for 7 € 
F,- Let Mrs. 2 and Mrs. 7 play against each other with respective 
partners Mr. A(i,7) and Mr. A(j,z) during round $(i,7). There 
will be q — 1 rounds, labeled by the off-diagonal symbols in S, 
during which everyone is playing in exactly one match. It will be 
found that no one meets her or his spouse as partner or opponent 
the entire tournament, but meets everyone else exactly once as an 
opponent, and has everyone of the opposite sex exactly once as a 
partner. For gq odd, we can have q rounds in which one couple sits 
out each round. 


Problem 22C. Show that m,m-+1,t and u each have the property 
that for each prime p < 2, they are either prime to p or divisible 
by p”. 


Problem 22D. Review the connection between sets of pairwise 
orthogonal Latin squares and transversal designs. 
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Problem 22E. Use Theorem 22.6. 


Problem 22F. Columns of A consist of integers between 1 and k. 
For every column of A and of S$ replace an integer 7 in the column 
of A by the 7-th element in the column of S. By adding suitable 
columns and one row, complete to an OA(v,c+4 1). 


Problem 22G. The proof is similar to the proof of Theorem 22.6. 
One now also needs a (V,K,D), a TD(v,k). 


Problem 22H. Use Problem 22F. 
Problem 221. Use Problem 22G. 
Problem 22J. Use Problem 22G. 


Problem 23A. Note that the closure of a set S of edges consists 
of all edges that have both ends in the same connected component 
of the spanning subgraph of G with edge set S. 


Problem 23B. The lines of AG,.(2) have size 2. 


Problem 23C. Dispose first of the cases when the union of two 
lines is the whole point set. Then show how to set up a one-to-one 
correspondence between the points of any two lines. 


Problem 23D. The hardest part is showing that F = {Fi UF): 
F, € Fy, Fy € Fo}. Prove that Fy U Fy = FU Fy using the fact that 
the rank of FU F> is the sum of the ranks of F\ and F». 

See H. Crapo and G.-C. Rota (1970) for a complete discussion 
of connectedness of combinatorial geometries and irreducibility of 
geometric lattices. 


Problem 23E. Suppose AN F = @ and rank(A) + rank(F’) = 
rank(A U F’). Show that these equations remain valid if A is re- 
placed by A’ := AU {x} where z is any point not in AU F. 


Problem 23F. Use (23.4). 


Problem 23G. We may work in PG2(F) and assume without loss 
of generality that L, = [1,0,0] and Ly = [0,1,0]. Then (explain 
why) a; = (0,a1,1), b: = (0, 01,1), and c; = (0,y1,1), while ag = 
(a2, 0, ie by = (82, 0, 1), and (= (Ya, 0, 1). 
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Problem 23H. See Chapter 13. 
Problem 23I. Just use the definition of X. 
Problem 24A. See Chapter 13. 


Problem 24B. One answer is e; = i(m —k +7). Consider k by 
n +m matrices of rank k that are in echelon form. How many of 
these have 7 leading 1’s in the first n columns? 


Problem 24C. Check that x and y are contained in a coset of a 
subspace U if and only if x — y is contained in U. 


Problem 24D. Count all n x n matrices. 
Problem 24E. See Theorem 6.4. 


Problem 25A. For a subspace U of the r-dimensional space W, 
let f(U) be the number of k-subspaces which intersect W in U. 
Use (25.5). 


Problem 25B. For a subspace U, let f(U) denote the number of 
nonsingular mappings whose set of fixed vectors is (exactly) U, and 
h(U) the number of nonsingular linear mappings that fix (at least) 
the vectors in U. It is easy to give a formula for h(U) in terms of 
n,q and the dimension of U. We require a formula for f({0}). 


Problem 25C. Use (25.2) to calculate (0,27) for x of rank 1 or 
2; 


Problem 25D. There are only n — 1 partitions x # Oz, (each 
consisting of one block of size 2 and n — 2 singletons) that satisfy 
rva=Orz. 

Problem 25E. For « < z < y a chain of length k from x to z 
extends to a chain of length k + 1 from x to y. 


Problem 25F. See Theorem 25.1 (iii). Count injective mappings 
from V to S and then vary ||. 


Problem 26A. The number of lines on a point x not in A which 
meet A is a constant, i.e. independent of x. 


Problem 26B. Any line disjoint from the arc can be used to ob- 
tain a partition of the blocks into parallel classes, one parallel class 
for each point of the line. 
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Problem 26C. The point of intersection of the line though (1, 0, 0) 
and (0,1,0) and the line though (0,0, 1) and (1,1,1), for example, 
is (1, 1,0). 


Problem 26D. Let ju; be the cardinality of the intersection of the 
i-th line with S. The inequality is easy if some py; > /n + 1, so 
assume otherwise and consider )7,(ju; — 1)(u; — /n — 1). 


Problem 26E. Whether the characteristic is even or odd, f is 
degenerate if and only if x(C +C') = 0 for some x with f(x) =0. 
To find such an x when f is degenerate, take the last row of a 
nonsingular matrix A such that AC/A' has a last row and column 
of all zeros, which exists because x,, say, does not occur in some 
projectively equivalent form. Also see J. W. P. Hirschfeld (1979). 


Problem 26F. By Theorem 26.6 with gq = 2 and n = 2m —1, 
nondegenerate quadratic forms in 2m variables have either 2?”~! — 
2-1 or 2?m-1 4 9M! Zeros, where we are counting the zero vector 
here. So the codewords (vectors of length 2””) corresponding to such 
forms have weight 2?"-!+2™-! or 2?7”-1_9m-1. Since, as functions 
over Fy, x? = 2;, we may think of the polynomials f(x) + a(x) as 
quadratic forms, or their complements when there is a constant 
term 1. Show that all these forms are nondegenerate. 

Alternatively, show that there is an ‘invertible affine substitution’ 
(where e.g. x; may be replaced by x; +1) that takes f(x) +a(x) as 
a function on F}?”, to f(x) or f(x) +1. 


Problem 26G. It is relatively straightforward to show that Q’ is 
a nondegenerate quadric in W. There is nothing more to do if n is 
odd. Theorem 26.5 can be used to complete the proof for n even. 
It is easy to see that if Q’ is hyperbolic, then so is Q. Next use 
equation (26.4) to show that every point p on a hyperbolic quadric 
Q in PG,,_1(q) is contained in a flat F’ C Q of projective dimension 
n/2—1. Check that such a flat is contained in T, so that the flat 
F OW CQ has dimension one less than that of F. 


Problem 26H. If point x is not on line @, consider the plane 
determined by x and @. 


Problem 261. The points of a triangle are on a line. For (iii) see 
Chapter 21. 
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Problem 26J. See the argument for arcs in projective planes. 


Problem 26K. Consider three points. Let x; be the number of 
blocks of the complementary design that contain 2 of these points. 
Use the usual counting argument to show that xo + x3 = A. 


Problem 27A. It is not difficult to see that the even numbered 
items are equivalent to each other, and similarly for the odd num- 
bered items. To show (3) and (4) are equivalent, for example, 
introduce an incidence matrix N and show that one is equivalent 
to the matrix equation NN! = (k — \)I + AJ, and the other to 
N'N=(k-A)I4+-AXJ. 


Problem 27B. View (Z2)* as a vector space over the field Fy. We 
may assume the difference set contains the zero vector. It will span 
(Zy)* and so contains a basis. 


Problem 27C. Count separately the number of times a nonzero 
element of G x H can be written as a difference of two elements of 
A x (H \ B), a difference of two elements of (G \ A) x B, and as a 
difference of one element of each of these sets. 


Problem 27D. Multiply the sum of the elements, that we want 
to show is 0, by any square. 


Problem 27E. We already said that this problem is similar to the 
illustrated calculation of Ao. 


Problem 27F. Use Theorem 27.5. (A table of cyclic difference 
sets may be found in L. D. Baumert, Cyclic Difference Sets, Lecture 
Notes in Math. 182, Springer-Verlag, 1971.) 


Problem 27G. A zero w of the polynomial y?+3y+2 (coefficients 
in F,) is a primitive element of Fys . 


Problem 27H. The cardinality of the intersection of a translate 
D+4q of D with the set —D is the number of times g arises as a 
sum from D. 


Problem 271. For distinct i and j, the differences x — y, x € 
U;,y € U;, comprise all vectors in V, each exactly once. 
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Problem 28A. The Frobenius automorphism of Femti) IS an 
automorphism of the symmetric design of 1-dimensional and n- 
dimensional subspaces of Fs(n+1) Over Eye. 


Problem 28B. We may take S(x) as in Lemma 28.4 with D = 
{1,2,4} but where a is not a multiplier. 


Problem 28C. If n = 0 (mod 10), say, then for any x € D, 
{z,2x, 42,57} C D and the difference x is seen to occur twice 
unless 3x = 0. If 3x = O for all x € D, then 3x = 0 for all x in 
the group, and then n? + n+ 1 must be a power of 3. Show that 
n? +n+1 is never divisible by 9. 


Problem 28D. The numbers of normalized difference sets with 
these parameters are, respectively: 2, 2, 4, 2, 0, 4. 


Problem 28E. There are two normalized (15, 7,3) difference sets; 
they are equivalent since one consists of the negatives of the ele- 
ments of the other. There are no normalized difference sets for any 
of the other parameters. To prove this in each case, first find a 
multiplier by Theorem 28.7. 

For e.g. (v, k, A) = (25,9,3), Example 28.3 shows 2 is a multiplier 
of a hypothetical normalized difference set D. If the group is cyclic, 
D is the union of cycles of «+> 2” on Zo5 which are 


{0}, {5, 10, 20, 15} and Zo; \ {0,5, 10, 15, 20}: 


but no such union has nine elements. If the group is elementary 
abelian (exponent 5), then D must consist of 0 and two cycles of 
the form {x, 2x, 4x, 8x = 3x}; but the difference x occurs five times 
already among 0 and the elements of such a cycle. 


Problem 28F. Assume D(x~!) = D(x) and use Lemma 28.2 with 
p= 2. 


Problem 28G. Recall the construction of a symmetric design 
with those parameters from a Latin square of order 6 in Problem 
19G. 


Problem 28H. Assume D(x’) = D(x). Note that g? = —1 
(mod |H|). Show D(x*) = 0 (mod q, H) and conclude that one 
of the coefficients of D(x) is q+ 1 and all others are 1. 
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Problem 281. The first part is similar to Problem 28H. Then 
show that if, in PG3(q), a line meets a set S of size g+1 ink >3 
points, then one of the g+1 planes on that line contains more than 
q+1 points of S. 


Problem 29A. Review the proof of Theorem 20.6. 


Problem 29B. Review the latter part of the proof of Proposition 
29.5. 

Problem 29C. The answer is (v,k, A) = (4n — 1,2n — 1,n — 1) 
or (4n — 1, 2n, 7), i.e. Hadamard or the complement of Hadamard. 
Review the proof of Theorem 29.7. 


Problem 30A. A, A» = 8A, + 8A» +9A3 and A? = 90 Ao + 83A, + 
56.A2 + 36A3. 


Problem 30B. By (30.3), A;A; = Sot» Pi(a) P;(a) Eq. Then by 


(30.4), 
k 


k 
N A,A; = S° P,(a)P;(a) YS" Qa(8) As, 


a=0 B=0 


so evidently Pi; a ee P,(a)P;(a)Qa(@). The second eigenma- 
trix Q can be calculated by inverting P (also see Theorem 30.2). 


Problem 30C. For the Latin square graphs, 
1 (n—1)r (n—1)(n—r+1) 
P= 1 n-r —nt+r—-1 : 
1 -r r—1 


Problem 30D. To evaluate the entries P;(0) of the top row of 
P, take the inner product (as introduced in the proof of Theorem 
30.2) of both sides of (30.3) with Ep. 


Problem 30E. This is the dual to Problem 30B. 
Problem 30F. Try |A| = |B| = 3. 


Problem 30G. Note that for any automorphism a, |ANo(B)| < 1. 
Count the number of automorphisms of G. 


Problem 30H. Given a metric scheme arising from a distance 
regular graph G, first note that Aj is a linear combination of 
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Ao, Aj,..., A; where the coefficient of A; is positive. Given a P- 
polynomial scheme, define a graph G by saying that x and y are 
adjacent in G if and only if they are lst associates in the scheme, 
and show that two points are 7-th associates if and only if they are 
at distance 2 in G. 


Problem 301. Use the counting argument of Problem 20K for 
words of weight 8 first; then find the other words satisfying the 
condition. Use the subcodes with a fixed 7 and the subcode of 
Problem 20K. 


Problem 30J. If such a code did exist, it would be equivalent to 
a code with a generator matrix with (1111100...0) as first row and 
below that (A B), where B generates a code of length 8, dimension 
5, and minimum distance at least 3. Now use the fact that the 
[7,4,3] Hamming code is unique (up to equivalence). 


Problem 30K. (i) Distance depends on the size of the intersec- 
tion only; just count; (ii) The graph has degree 4 and two triples 
corresponding to the Fano plane meet in one point. 


Problem 30L. For J(k,v) show that a necessary condition is that 
1+ k(v —k) divide (7). 


Problem 31A. Consider the intersection of the eigenspaces of A 
and A» corresponding to the eigenvalue 1. 


Problem 31B. Since the adjacency matrix of the complement of 
a graph G with adjacency matrix A is J — I — A, the eigenvalues 
of G are easily determined from those of G. The first examples in 
Chapter 21 are instances in which equality can occur. 


Problem 31C. (i) The Petersen graph has 3-claws, i.e. induced 
subgraphs on four vertices consisting of three edges incident with 
a common vertex (and no other edges). Explain why a line graph 
cannot contain a 3-claw. (ii) The matrix N'N is positive semidef- 
inite and is equal to 2J + A, where A is the adjacency matrix. 


Problem 31D. From the equation following (31.1) with x the 
vector of all 1’s, it follows that the largest eigenvalue of a graph is 
bounded below by the minimum degree of the graph. By Lemma 
31.5, the largest eigenvalue of a graph G is bounded below by the 
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minimum degree of any induced subgraph H of G. If y(H) = x(G) 
and the deletion of any vertex x of H decreases the chromatic 
number, then deg(x) > y(H) — 1. 


Problem 31E. If there are no directed paths from x to y in a 
digraph G with adjacency matrix A, let S be the set of vertices 
z such that there does exist a directed path in G from 2x to z. 
Then A(z, y) = 0 whenever a € S and b € V(G)\S, so A is not 


irreducible. The converse is even easier. 


Problem 31F. (i) Show by straightforward calculation that u;A = 
Ajuj, where uj; = (1,47,w”,... ,w"-)J). For (ii), define, for each 
character x, a vector uy whose coordinates are indexed by G and 
where uy(g) = x(g), and show that uyA = A,uy. 


Problem 31G. To prove the lemma, show that if x is a row-vector 
so that xS = 0, then xa = 0. Apply the lemma to S = A+ TJ, 
where A is the adjacency matrix of the graph. The lemma may 
be found in B. Bagchi and N. $8. Narasimha Sastry, Even order 
inversive planes, generalized quadrangles and codes, Geometriae 
Dedicata 22 (1987), 137-147. The graph problem (phrased in terms 
of ‘acquaintances’) is Problem 10851, by D. Beckworth, from the 
American Mathematical Monthly 108. 


Problem 31H. Most of the work has been done in the proof of 
Theorem 31.10: If the distinct eigenvalues of G are its degree d and 
ji, i = 1,2, then (A — py)(A— pol) = 4(d— 1) (d — 2) J where A 
is the adjacency matrix of G. This means that A? is in the algebra 
generated by J, J, and A. So G is strongly regular. 


Problem 311. Let G be an srg(v,k,A,). The eigenvalues of 
A(x) interlace the eigenvalues 


LAN Creare Lae er: eee a (cee PCT) 


of G. Any sequence that interlaces this can have at most one term 
which is greater than r. 

Except in the half-case, r and s are integers with rs = w — k; in 
particular, r < k — yu (even in the half-case). The graph A(z) is 
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regular of degree k — yz. Were it not connected, k — ys would be an 
eigenvalue of A(x) of multiplicity greater than one. 


Problem 31J. The eigenspace of A corresponding to eigenvalue 1, 
and the eigenspace of B corresponding to —1, both have dimension 
5, but are contained in the 9-dimensional space of vectors orthogo- 
nal to the vector of all 1’s, and so there is a nonzero vector u in the 
intersection of the eigenspaces. Show that connectivity of K leads 
to a contradiction to Theorem 31.11. 


Problem 31K. (i) This follows from Theorem 31.13 or part of its 
proof. (ii) Multiply (31.6) by J. (iii) The number of undirected 
edges is one-half the trace of A?. (iv) Show that the undirected 
edges do not have a common vertex and that they are not joined 
by an edge. Up to reversing directions, this leaves only one way to 
complete the graph. 


Problem 32A. One of the graphs has two vertices, one has three 
vertices, and two have four vertices. 


Problem 32B. Let X be the set of vertices reachable by a directed 
path from s with f(e) > 0 on all edges of the path, and let Y be 
the remaining vertices, if any. If t ¢ X, we have a contradiction to 
Eq. (7.1) (why?). For part (ii), use induction on k. 


Problem 32C. Assign capacity 1 to all edges of D. Use Theorem 
TL. 


Problem 32D. If (H,k) is an ¢-separation and the deletion of 
the @ vertices S := V(H) 1 V(K) does not disconnect the graph, 
then at least one of H or Kk has vertex set contained in S. 


Problem 33A. Briefly explain why spanning trees of G that con- 
tain the edge e are in one-to-one correspondence with the spanning 
trees of G’ and why spanning trees of G that do not contain the 
edge e are in one-to-one correspondence with the spanning trees of 
G'. 

Problem 33B. Find an answer of the form f(A) +(—1)"g(A). Use 
induction, Equation (32.1), and Example 32.1. 


Problem 33C. More strongly, a subdivision of A’33 (where edges 
are replaced by paths) occurs as a subgraph of the Petersen graph. 
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Problem 33D. Suppose a vertex of GG is incident with three dis- 
tinct edges e1, €2,e3. Such a vertex is a connected component C’ of 
G: S. So e; has an end a; in C, i = 1,2,3 (these ends are not 
necessarily distinct). Explain why the connectivity of C implies 
that there exists a vertex x and possibly degenerate but internally 
disjoint paths in C from x to each of x1, x2, and x73. That is, we 
have a subgraph of C' isomorphic to a subdivision of the letter Y 
with the x;’s at the ends of the arms if the paths all have length 
> 1, but otherwise some of the arms may be degenerate. This ob- 
servation immediately leads to a solution of the first part of the 
problem. 

Given four vertices x;, 1 = 1,2,3,4, of a connected graph C’,, show 
that there is a subgraph isomorphic to a subdivision of either the 
letter X or the letter J, though with possibly degenerate arms, and 
with the x;’s at the ends of the arms. 

Check that if a vertex, say 1, of Ks is replaced by two vertices 
1; and 1p joined by an edge, and the edges joining 1 to the other 
vertices are split, say 1; is joined to 2 and 3, and 1, is joined to 4 
and 5, then the resulting graph has K33 as a subgraph. Combine 
this with the observation of the previous paragraph to finish the 
problem. 


Problem 33E. Use Euler’s formula. The only pairs (d,, dz) that 
arise other than those (3,3), (3,4), (3,5), (4,3), (5,3) that corre- 
spond to the five Platonic solids are those (2,n) and (n,2), n > 2, 
that correspond to the polygons and bond-graphs. 


Problem 34A. We have a linear mapping from the vector space 
over F, of all subsets X of the vertex set V(G) onto the cutset space 
of G, namely X + x(X,V(G) \ X). (The matrix of this mapping 
is N, the incidence matrix.) What are the sets X in the kernel of 
this mapping? The dimension of the cutset space is the number of 
vertices minus the dimension of the kernel. 


Problem 34B. Use induction on the cardinality of the support of 
codewords. 


Problem 34C. The circuits of G! are exactly those circuits of G 
that do not contain e. What are the bonds of H!? 
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Problem 34D. One way to do this would be to first prove two 
things about subsets S of the edge set of a connected graph K: (1) 
S is the edge set of a spanning tree in K if and only if no circuit of 
K is contained in S and S is maximal with respect to this property 
and (2) S is the edge set of a spanning tree in K if and only if every 
bond of K meets S nontrivially and S' is minimal with respect to 
this property. 


Problem 34E. Show first that the two components that result 
from deleting an isthmus from a connected trivalent graph each 
have an odd number of vertices. 


Problem 34F. Here is an idea to construct bonds. Deleting any 
edge of a spanning tree of G leaves a subgraph with two compo- 
nents; the set of edges with one end in each component is a bond 
of G. 


Problem 34G. Deleting the edges of a bond S from a connected 
graph G always gives a graph with two components, but recall that 
we have chosen to delete isolated vertices when we defined G'g. In 
part (ii), an equivalence of codes gives a one-to-one correspondence 
between the edge sets of H and G that takes circuits to circuits and 
bonds to bonds. Use part (i) and the fact that a graph without 
isolated vertices is nonseparable if and only if any two edges are 
contained together in a circuit; cf. Theorem 32.2. 


Problem 34H. Given a polygon P in G, one may consider the 
subgraphs H and Kk where H = P and K is the spanning subgraph 
of G containing all edges of G other than those of P. 

Consider a bond B in G that consists of all edges with one end 
in X and one end in Y, where X and Y partition V(G) and each 
induces a connected subgraph. Suppose for contradiction that G is 
Tutte-k-connected but that |B| = @ < k. Let ¢, be the number of 
vertices in X incident with edges of B. If the subgraph induced by 
X has at least 2; edges, we easily find an ¢,-separation of G. Deal 
with the remaining situation. 


Problem 34]. We give independent hints, even though one can 
derive (i) from (ii), or, as we have mentioned, the first two parts 
from (iii). 
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(i) If A and B are edge sets of polygons P and Q, start traversing 
P in both directions starting with the edge y, and stop as soon as 
vertices in @ are reached. 

(ii) If A and B are the supports of codewords a and b, the support 
S of some linear combination of a and b will contain coordinate y 
but not x. Show that any support S of a codeword is the union of 
minimal supports of codewords, by induction on |S]. 

(iii) Show that, in general, if z € S, then there is a minimal 
dependent set D so that z€ DC SU {z}. (Review Lemma 23.2.) 
Then show that y is in the closure of (A U B) \ {z, y}. 


Problem 35A. Take the vertices of K, to be Z,. For n = 5, use 
all cyclic shifts (translates) of [1,2,4,3] (mod 5). 


Problem 35B. Let us drop the last coordinate in the description 
of Example 21.4 and describe the Clebsch graph as having vertex 
set F} and where x and y are adjacent when x + y has weight 3 or 
4. Try to choose one walk w and obtain the others as translates of 
w by all elements of Fj. 

Problem 35C. Part (i) is similar to Theorem 33.5. For the proof 
of uniqueness of the mesh, we may take V(K7) = {0,1,...,6} and, 
without loss of generality, the walks traversing 0 are 


(6,0, 1], [1,0,2], [2,0,3], [3,0,4], [4,0,5], [5,0, 6]. 


The walk traversing the edge (6, 1) has third vertex 3 or 4 (otherwise 
the vertex condition is violated at 1 or 6) and in either case the 
other walks are uniquely determined. 


Problem 35D. The vertices of the graph M, are the edges inci- 
dent with x. These edges form the edge set of a bond in G and 
hence are the edges of a circuit in H. 


Problem 35E. Let us use the term flag for a triple (x, e, F’) con- 
sisting of a vertex x, an edge e incident with x, and a face F incident 
with e. There is exactly one other vertex 2’ so that (2’,e, F) isa 
flag, exactly one other edge e’ so that (x,e’, F’) is a flag, and ex- 
actly one other face F” so that (x,e, F’) is a flag; these objects 
must hence be fixed by any automorphism a that fixes the terms 
of (256. F ): 
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Problem 35F. Use Theorem 35.1. The least value of n > 2 for 
which K,,,, embeds in an orientable surface so that all faces are 
quadrilaterals is n = 6; nm must be even, but Ky, does not so 
embed in Jo. Some ideas and/or analysis of cases is required. 


Problem 35G. All faces will have the same size. Consider the 
walk traversing the edge (0,1). The next several edges will be 
(1,l—w), (l-w,l-w+w7), (l-w+u?,1l-w+wu? —w?), etc. If 
m = 2 (mod 4), for example, then the walk has length m/2. 


Problem 36A. The sum of the determinants of all principal n—1 
by n — 1 submatrices of M is an appropriate sign times the coeffi- 
cient of x in the characteristic polynomial det(aI — M) of M. 


Problem 36B. There is one such arborescence for every mapping 
f : {2,3,...,n} — {1,2,...,n2—1} so that f(z) <i for each i. 


Problem 36C. If a vector g with coordinates indexed by E(H) 
has the property that for every closed walk w, the signed sum of the 
values of g on the edges of w is zero, then define a vector h indexed 
by V (A) as follows. (We assume H is connected for convenience; 
otherwise work separately with each component.) Fix a vertex 2, 
define h(x) := 0, and for any vertex y, let h(y) be the signed sum 
of the values of g on the edges of any walk from x to y. Check that 
h is well defined and that g is the coboundary of h. 


Problem 36D. Let Z denote the cycle space and B the cobound- 
ary space. These are orthogonal complements, so their (direct) sum 
is R™ (the space of all vectors with coordinates indexed by E(D)). 
Show that the linear transformation R is one-to-one on Z and that 
ZROB = {0}, so that R” is the direct sum of these subspaces. 


Problem 36E. The sizes of all squares in Fig. 36.4 may be ex- 
pressed as integral linear combinations of the sizes x,y,z of the 
three indicated squares. For example, the smallest square has size 
z—a and the next smallest has size x + z—y. It will be found 
that the sizes of some squares can be so expressed in more than one 
way, and we thus have linear relations between x, y, z. Eventually, 
we can express the size of any square as a rational scalar multiple 
of x, and the least common denominator of these rational numbers 
is the value we should take for x. 
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Problem 36F. It may be quickest to proceed as in Problem 36E; 
let x,y,z, for example, denote the values of the current on three 
edges, etc. But one can evaluate determinants as in equation (36.3) 
to find the solutions. 


Problem 37A. The rotations include: the identity (1), rotations 
of 90, 180, and 270 degrees about an axis through the centers of 
two opposite faces (3 + 3+ 3), rotations of 180 degrees about an 
axis through the centers of two opposite edges (6), and rotations of 
120 and 240 degrees about an axis through two opposite vertices 
(2+2+2+42). The latter eight rotations, for example, contribute 
8X3 to the cycle index. 


Problem 37B. Use Theorem 37.2 and Problem 37A. 


Problem 37C. Explain why the number of injections f fixed by 
(0,7) depends only on the cycle structure of o and 7 (i.e. on the 
numbers z; of cycles of length i for each 7) and find an expression 
for the number of injections in terms of the z;(7)’s and the z;(T)’s. 


Problem 37D. In Theorem 37.3, take G to be dihedral and H to 
be the symmetric group 52. 


Problem 37E. Ifa rotation p of the cube has a cycles on the faces 
and 6 cycles on the vertices, the number of colorings fixed by p is 
ts”. Use Burnside’s Lemma. 


Problem 38A. Let m :=v/2. For each of the M := v — 1 colors 
i, we obtain an m-partition of V(K,) by taking the edges of color 
i, all singletons {x} for x not incident with an edge of color i, and 
enough copies of the empty set to make an m-partition. Check that 
the condition (38.1) holds. Review the proof of Theorem 38.1. 


Appendix 2 


Formal power series 


Before introducing the subject of this appendix, we observe that 

many of the assertions that we shall make will not be proved. In 

these cases the proof will be an easy exercise for the reader. 
Consider the set 


CNo — {(ao, G1, @2,.-.) z VieNy [Gi — C}}. 


On this set we introduce an addition operation and a multiplication 
as follows: 


(ao, @1,...) + (bo, b1,...) = (do + bo, a1 + b1,..-.), 


(ao, Q1,.-- )(bo, by, Perers ) = (Co, Ci,--- We 
where ¢, := oj @iDn_i- 

This definition produces a ring that we denote as C|[z]] and call 
the ring of formal power series. This name is explained in the 
following way. Let z := (0,1,0,0,...). Then z” is the sequence 
(0,...,0,1,0,...) with a 1 in the n-th position. So, formally we 
have 


a= (400i) = So anz” =? alZ): 
n=0 
We shall use both notations, i.e. a and a(z), for the sequence. We 
say that a, is the coefficient of z" in a (or a(z)). Notice that C[z], 
the ring of polynomials with coefficients in C, is a subring of C|[z]]. 
Some of the power series will be convergent in the sense of analysis. 
For these, we can use results that we know from analysis. Quite 
often these results can be proved in the formal sense, i.e. not using 
convergence or other tools from analysis. 
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Example 1. Let f := (1,1,1,...). Using the definition of multi- 
plication and 1—z = (1,-1,0,...), we find (1—z)f = (1,0,0,...) = 
1. So, in C[[z]], we have f = (1— z)7', ie. 


1 


ee) 
= > ee 
- n=0 


a result that we knew from analysis. 


The regular elements in C|[z]] are the power series with ap ¥ 0. 
This can be seen immediately from the definition of multiplication. 
From the relation a(z)b(z) = 1 we can calculate the coefficients bn, 
since b) = aj" and bp = —ap' d>7_, aibn_i. From this we see that 
the quotient field of C[[z]] can be identified with the set of so-called 
Laurent series \\~~),a,z", where k € Z. Since it plays a special 
role further on, we give the coefficient of z~! a name familiar from 
analysis. If a(z) is the series, we say that a_; is the residue of a(z). 
This will be written as Res a(z). 


Reni Caer ern Criz' (n = 0,1,2,...) be elements of C[[z]] with 
the property 


Vion [n > hy > Chi = Oj. 


Then we can formally define 


(1) ye) = ye “«) ae 


n=0 i=0 n=0 


This definition allows us to introduce substitution of a power series 
b(z) for the ‘variable’ z of a power series a(z). If b) = 0, which we 
also write as b(0) = 0, then the powers b"(z) := (b(z))” satisfy the 
condition for formal addition, i.e. 


a(b(z)) = S> anb"(z) 


n=0 


makes sense. 
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Example 2. Let f(z) := (1-2), g(z) = 2z— 2”. Then formally 


h(z) = flo(z)) = 1+ (22 = 27)+ (22 = 2’)? te 
=142z+327+4234---. 


From calculus we know that this is the power series expansion of 
(1 — z)~?, so this must also be true in C[[z]]. Indeed we have 


(l—2)A(z) = S02" = (1-2) 1. 


n=0 


This also follows from (legitimate) algebraic manipulation: 


f(g(z)) = (1—- (22 -2))" = - 2). 


Many power series that we often use represent well known func- 
tions, and in many cases the inverse function is also represented by 
a power series. This can also be interpreted formally for series f(z) 
with fo = 0 and f; 4 0. We ‘solve’ the equation f(g(z)) = z by 
substitution. This yields fig, = 1, fig. + 9? = 0, and in general 
an expression for the coefficient of z” starting with fig, for which 
the other terms only involve coefficients f; and coefficients g, with 
k <n. Setting this 0 allows us to calculate gy. 


Example 3. The reader who likes a combinatorial challenge can 
give a proof by counting of the formula 


“. (2k\ (2n — 2k yn 
le) na) 
k=0 
This would be equivalent to proving that the formal power series 
f(z) = ey 7") (z/4)” satisfies the relation f?(z) = (1—z)7!. By 
similar arguments or by algebraic manipulation one can then find 
the formal power series that deserves the name (1 + z)2. 

Now consider the formal power series f(z) := 2z + 27. The ‘in- 
verse function’ procedure that we described above will yield a power 
series g(z) that satisfies 2g(z) + g?(z) = z, ie. (1+ 9(z))? =1+4+z. 
So this should be the series that we just called (1+ z)2. It should 
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no longer be surprising that algebraic relations that hold for con- 
vergent power series are also true within the theory of formal power 
series. 


Remark. We point out that substitutions that make perfect sense 
in calculus could be forbidden within the present theory. The power 
series )>~° 9 4 will of course be given the name exp(z). In calculus 
we could substitute z = 1+ and find the power series for exp(1+<). 
We do not allow that under formal addition. 


We now introduce formal derivation of power series. 


Definition. If f(z) € C[[z|], then we define the derivative 


(Df)(z) = f'() 
to be the power series )°°° , nf,2""!. 


The reader should have no difficulty proving the following rules 
by using the definitions in this appendix: 


(D1) (f(z) + g(2))' = f'(2) + 9); 

(D2) (f(z)g(z))' = flz)g(z) + Flz)a'(2); 
(D3) (f*(z))' = kf" (2) f'(); 

(D4) (f(9(2))) = f(g) 9'()- 


The chain rule (D4) is an example of a more general statement, 


namely that 
D (> w()) re S- le mea ye 
n=0 


n=0 
If convergence plays a role, this is a difficult theorem with extra 
conditions, but for formal power series it is trivial! 

The familiar rule for differentiation of a quotient is also easily 
proved. So we can carry the theory over to Laurent series. We 
shall need the following two facts. Again we leave the proof as an 
exercise. 


If w(z) is a Laurent series, then 

(R1) Res(w’(z)) = 0; 

(R2) the residue of w’(z)/w(z) is the least integer @ such that the 
coefficient of z‘ in w(z) is nonzero. 
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We have already mentioned the idea of an ‘inverse function’ and 
have shown how to calculate its coefficients recursively. The next 
theorem gives an expression for the coefficients. 


Theorem 1. Let W(z) = wiz + wz? +--+ be a power series with 
w, #0. Let Z(w) = cw + cow? +--+ be a power series in w, such 
that Z(W(z)) = z. Then 


if 
a R [Sar pst 
ee as (ai) 


PROOF: Observe that c) = w;, | We now use formal derivation and 
apply it to the relation Z(W(z)) = z. This yields 


(2) 1=S > keqyW*1(z)W'(z). 
k=1 


Consider the series obtained by dividing the right-hand side of (2) 
by nW"(z). Ifn # k, then the term W*-!-"(z)W"(z) is a derivative 
by (D3) and hence has residue 0 by (R1). By applying (R2) to the 
term with n = k we find the assertion of the theorem. 


This theorem makes it possible for us to give a proof of the 
Lagrange inversion formula (see Theorem 14.3) within the theory 
of formal power series. Let f(z) be a power series with fo # 0. 
Then W(z) := z/f(z) is a power series with w, 4 0. Now apply 
Theorem 1. We find that z = S°™, c,w” with 


n=1 


tes (<2) = (0 py), 


n n! 


which is (14.19). This approach is based on P. Henrici, An alge- 
braic proof of the Lagrange-Biirmann formula, J. Math. Anal. and 
Appl. 8 (1964), 218-224. The elegant simplification is due to J. W. 
Nienhuys. 


As was observed above, we define 


(3) exp(2):= > 
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From previous knowledge we expect that replacing z by —z will 
yield the inverse element in the ring C|[z]]. Formal multiplication 
and the fact that S>;_9(—1)*(Z) = 0 for n > 0 shows that this is 
true. 

It is now completely natural for us to also define 


CO 


(4) pee eis= een 


n=1 


Again, calculus makes us expect a relation between the ‘functions’ 
log and exp. By our substitution rule, it makes sense to consider 
the power series log(exp(z)). From (D4) we then find 


D(log(exp(z))) = 22) = 1, 
exp(z) 


ie. log(exp(z)) = z. (Here we have used the fact that the formal 
derivative of log(1 +z) is (1+ z)71.) 


Of course, much more can be said about formal power series. 
One could explore how many familiar results from analysis can be 
carried over or given a formal proof. We hope that this sketchy 
treatment will suffice to make things clearer. 
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Bose—Mesner algebra of a 
strongly regular graph 264ff 

breadth-first search 19, 81 

bridge 20, 482 

bridges of Konigsberg 6, 7, 10, 
59 

Brooks’ theorem 24, 34 

Bruck—Ryser—Chowla theorem 
230, 241, 383, 398 

De Bruijn graphs 71ff 

De Bruijn sequences 71ff, 513 

De Bruijn—Erdés theorem 216, 
433 

Burnside’s lemma 94, 97, 370, 
524 


capacity of a cut 62ff 

capacity of an edge 62ff 

Catalan numbers 136-141, 150, 
16 

Cauchy—Binet theorem 508, 510 

Cauchy inequality 115 

Cayley graph 446, 504-505 

Cayley’s theorem 12ff, 145, 508 

cell 162 

central collineation 367 

chain 53ff, 305, 348 

Chang graphs 271, 280 

character 202, 248, 394 

cheapest spanning tree 18, 543 

Chinese Remainder Theorem 
293 

chromatic number 24, 327, 398, 
466, 571 

chromatic polynomial 341, 349, 
459 

circle geometries 365 

circuits of a graph 475ff 

circulant graph 446, 448 

circular sequences 94—95, 120 

circulation on a digraph 66ff 


592 Subject Index 


claw 218, 500 

claw bound 272-273 

Clebsch graph 263, 276, 279, 
440, 447, 498, 505, 575 

clique in a graph 271, 435 

clique in a scheme 421 

closed path 5 

closed walk 5, 515 

closure in a combinatorial 
geometry 308ff 

coboundary space of a digraph 
514 

coclique in a graph 4384-435, 440 

coclique in a scheme 421 

code 244 ff, 396ff 

code in a scheme 422 

code of a design 256-258, 396ff 

code of a graph 475ff 

codewords 244 

coding theory 244ff 

coline 309 

collinear 272 

collineation of a projective plane 
353, 366 

colorings of graphs 24, 306, 459ff 

column-complete 196 

column of a Latin square 182 

combinatorial design 351 

combinatorial geometry 306ff 

combinatorial proofs of identities 
121-122, 125, 138, 147, 552 

cometric scheme 422 

complement of a design 220, 
251, 569 

complement of a difference set 
391 

complement of a graph 393, 563, 
570 

complement of a symmetric 
design 401 

complement of an incidence 
structure 215 


complementing permutation 342 

complete bipartite graph 6, 39, 
432 

complete cycle 72-75 

complete graph 3, 12, 24, 27-28, 
30, 33, 37ff, 78, 217, 261, 432, 
491, 492, 494, 500, 502, 508, 
540 

complete mapping of a group 
298 

complete matching in a bipartite 
graph 43ff, 56 

complete multipartite graph 38 

complete uniform hypergraph 
540 

complexity of a graph 460, 519 

component 5 

composition methods 288 

composition of a number 154, 
553 

conference matrix 200ff, 233, 
265, 402 

configuration counting series 528 

confoundable words 436 

conjugate of a partition 156, 169 

conjugates of a Latin square 183 

connected combinatorial 
geometry 313, 564 

connected graph 5-6, 347, 451 

connected strongly regular graph 
261 

connectivity of a graph 451ff 

conservation of flow 62 

contraction of edges 462ff 

convex n-gon 33, 139, 146 

convex combination of 
permutation matrices 49, 111 

convex polytope 336, 349, 456, 
465, 486 

cost of an edge 17 

cotree 462, 473 
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counting in two ways 4, 207, 
212, 219, 262 

covering radius 245, 247 

covering in a poset 303, 305, 335 

cut in a transportation network 
62 

cutset space of a graph 476ff 

cutsets of a graph 475ff 

cycle index 524 

cycle space of a digraph 514 

cycle space of a graph 453 

cycles of a graph 475ff 

cycles of a permutation 123, 127, 
524 ff 

cyclic difference set 373ff, 386, 
567 

cyclotomic scheme 408 


d-code in a metric scheme 422 

De Bruijn graphs 71ff 

De Bruijn sequences 71ff 

De Bruijn—Erd6s theorem 216, 
433 

decode 209, 212 

Dedekind 7-function 167 

degree of a face 465 

degree of a vertex 4 

degrees of a scheme 406 

deletion of edges 459 

Delsarte’s inequalities 405, 416, 
418 

dependency graph 31 

depth-first search 19, 81 

depth of an orthogonal array 183 

derangements 90, 97, 129, 142, 
149 

derivation 146 

derived design 228 

Desargues configuration 317 

Desargues’ theorem 318, 321, 
322 


Desarguesian planes 319, 352 

descendant 20 

design of experiments 240 

design theory 215 

designs in a scheme 422 

determinant of the distance 
matrix 80 

difference methods 234, 240, 
288, 294ff, 498 

difference set 369ff, 389ff 

digraph 2, 194, 454 

Dilworth’s theorem 53, 55 

dimension of a combinatorial 
geometry 304 

Dinitz conjecture 194, 469 

directed Eulerian circuit 16, 71, 
75 

directed graph 2 

directed path 16 

discrepancy function 82 

distance in a code 244 

distance between vertices 5 

distance matrix of a graph 79-80 

distance regular graph 407 

distribution vector in a scheme 
406, 416-417 

dodecahedron 8, 492 

dominant eigenvalue 444 

doubly stochastic matrix 49, 
104, 108, 110ff 

drawing of a graph 1, 461 

dual arc in projective planes 
352-353 

dual code 246 

dual graph 466ff, 472ff 

dual of a symmetric design 228 

dual partial geometry 272 

dual poset 307 


echelon form 246, 327-328 
edge colorings 27 
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edges of a graph 1 

eigenmatrices of a scheme 413, 
415 

eigenspaces 264, 410 

eigenvalues of a graph 264, 433ff 

electrical network 507, 514, 515 

elementary cycle 514 

elementary flow 64 

elliptic quadratic form 359ff 

elliptic quadric 359ff 

embedding of a graph 461ff 

endpoints of an edge 1 

ends of an edge 1 

equivalent codes 246 

equivalent designs 221 

equivalent difference sets 374 

equivalent Latin squares 183 

Erdés—Ko-Rado theorem 56, 59, 
331, 418, 430 

Erdés—number 5, 10 

error-correcting code 208, 
244-246, 258-259 

Euler characteristic 497 

Euler function 92 

Euler’s conjecture 281ff, 288, 300 

Euler’s formula 349, 464, 465, 
466, 480, 491, 542, 573 

Euler’s identity 157 

Eulerian circuit 6-9, 71, 513 

Eulerian graph 6 

Eulerian poset 350 

Evans conjecture 189-193 

excess of a Hadamard matrix 
207 

exchange axiom 309 

exponential generating function 
129ff 

extended code 246 

extended design 239 

extremal graphs 38-40 

extremal set theory 56, 59 


face of a convex polytope 456 

face of a graph or an embedding 
463 

falling factorial 119 

Fano configuration or plane 225, 
237, 255, 258, 354, 394, 430, 
434, 492 

fast Fourier transform 212 

feasible flow 61ff 

feasible parameters for strongly 
regular graphs 265 

Fekete’s lemma 103, 106, 108, 
130, 150, 436 

Ferrers diagram 156ff, 170-171, 
327-328 

Fibonacci numbers or sequence 
97, 149, 150, 550 

Fibonacci recursion 150, 159, 
551 

finite differences, calculus of 97 

finite graph 4 

first order Reed—Muller code 
210-212, 361 

Fisher’s inequality 222, 226, 353, 
364, 405, 430 

Five Color Theorem 34, 466, 469 

fixed points of an automorphism 
353, 369 

flags of an incidence structure 
215 

flats 303, 309, 315 

flow 61 

Ford sequence 75 

forest 17 

formal derivative 145, 581 

formal power series 130-151, 
578 ff 

formally dual schemes 420 

formally dual theorems 424 

four color problem 459ff 
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Four Color Theorem 24, 34, 427, 
466, 482 

Friendship Theorem 279 

Frobenius automorphism 568 

fully indecomposable matrix 111 


g-torus 491 

Gaussian coefficients 325 

Gaussian numbers 312, 325ff 

Gaussian polynomials 326 

generalized quadrangle 273-274, 
280, 363, 367 

generating function 126, 129ff 

generator matrix of a code 246, 
570 

Generalized Steiner System 
238-239, 304 

genus of a surface 491 

geometric graph 272 

geometric lattice 305ff, 340, 342, 
490, 564 

Gewirtz graph 267 

girth of a graph 9, 39-40 

Golay codes 241-244, 252-255, 
259, 279, 417, 429, 560 

graceful labeling 22 

Graeco-Latin square 284 

grand clique in a strongly 
regular graph 271, 280, 561 

graph of a polytope 456 

graph 1 

greedy algorithm 18 

group ring 383ff 


Hadamard 2-design 218 

Hadamard 3-design 218, 220, 
252 

Hadamard difference set 
375-377, 380, 569 

Hadamard matrix 199ff, 218, 
279, 411-412, 424 


Hadamard product of matrices 
268-269, 409, 415, 419 

half-case of strongly regular 
graphs 265, 277 

Hall multiplier 386 

Hall’s theorem 43-47, 65 

Hamiltonian circuit 8, 40-41, 
449, 483, 545 

Hamming bound 245 

Hamming code 246-247, 376, 
560 

Hamming distance 77, 244 

Hamming scheme 406, 411ff 

handle of a representation 437 

hash-coding 84 

hash functon 84 

head of a directed edge 2, 460 

Heawood conjecture 493, 498 

Heawood graph 433, 434, 492 

hermitian form 364ff 

hermitian variety 364ff 

higher incidence matrices of a 
t-design 222 

Higman-Sims graph 267, 279 

Hoffman-Singleton graph 41 

homeomorphic graphs 453 

homogeneous coordinates 317 

homomorphisms of the group 
ring 385ff 

hooks in Young tableaux 165 

hooklengths 165-166 

horizontally convex polyomino 
132 

hyperbolic quadratic form 359ff 

hyperbolic quadric 359ff 

hypercube 77 

hyperoval 227, 248, 255, 258, 
266-267, 274 

hyperplanes in combinatorial 
geometries 309ff 
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idempotent quasigroup 28ff 

idempotents in the Bose—Mesner 
algebra 236, 410 

idempotents in the group ring 
402 

incidence algebra of a poset 333ff 

incidence in a graph 1 

incidence matrix 221 

incidence matrix of a design 
221ff, 225 

incidence matrix of a directed 
graph 62, 509 

incidence matrix of a graph 475 

incidence matrix of a symmetric 
design 228ff 

incidence structure 215 

inclusion—exclusion 89-97, 98, 
122, 177, 337, 535, 549, 556 

independence number of a graph 
436 

independent edges in a graph 18 

independent set 37 

independent subset of a 
combinatorial geometry 309ff 

independent vertices in a graph 
37, 434-435 

index of a design 216 

index of an orthogonal array 
216, 423 

induced drawing 463 

induced subgraph 4, 77 

information rate 209 

inner (or dot or scalar) product 
115-116, 396ff, 415 

Instant Insanity 7 

integral flow 64-66, 171, 540 

integrality condition for strongly 
regular graphs 265, 561 

interlacing of eigenvalues 
439-440, 561, 571 

interval of a poset 307 


invariant factors 399 

inversive plane 362 

involution 323 

irreducible matrix 442 

isolated vertex 1 

isomorphic designs 221 

isomorphic difference sets 380 

isomorphic graphs 2, 411, 466 

isomorphic Latin squares 183 

isomorphic orthogonal arrays 
183 

isotropic vector 115 

isthmus of a graph 20, 482 


Jacobi triple product identity 
160 

Johnson scheme 406, 413ff 

join (least upper bound) 305ff 

joins (in a graph) 1 

Jordan arcs 472ff 

Jordan curve theorem 464 

Joyal theory 141ff 


k-class association scheme 406 

k-connected graph 451 

k-subspace 325 

k-Tutte connected 457, 489 

k-vertex connected 451, 454 

Kempe chain 467 

Kirchhoff’s laws 500, 515 

Kirkman schoolgirl problem 240 

Klein bottle 499 

K6nig’s theorem 48, 51 

Krein condition 267, 269, 273, 
279, 280 

Krein parameters of a scheme 
420 

Kronecker product 201, 208, 
213, 268, 287, 309, 437 

Kuratowski’s theorem 462 
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labeled graph 12, 149 

labeled tree or forest 12ff, 141, 
145 

Lagrange inversion formula 145, 
151, 582 

Latin rectangle 186 

Latin square 182ff, 226, 273, 358 

Latin square graph 274, 276, 414 

Latin squares, number of 
186-187, 192 

lattice (poset) 305ff, 333ff 

lattice graph 262 

lattice of contractions 308, 340 

Laurent series 579, 581 

length of a walk 5 

length of an edge 17 

line graph 194, 435-436, 570 

line of a block design 323 

line of a linear space 215 

line of a matrix 48 

line of a projective plane 225 

line of perspectivity 317 

linear code 244ff, 398 

linear order 53 

linear programming 69, 457 

linear programming bound 406, 
418, 419 

linear recursion 130-134 

linear space 215, 289ff, 303, 313 

line-sums of (0, 1)-matrices 
100-101, 173-181 

link-graph 451, 471, 453 

list assignment 194 

list-colored graph 194 

Lloyd’s theorem 405, 427 

loop of a graph 1 

loop switching problem 88 

Lorentz space 115-118 

Lovasz Sieve 31 


MacNeish’s theorem and 
conjecture 288, 290 


MacWilliams’ theorem 248, 418 

MacWilliams relations 249 

magic square 300 

majorize 169-170 

Mantel’s theorem 37 

map on a surface 491 

marriage theorem 43, 65 

matching in a graph 43 

Mathieu groups 259, 560 

matrix-tree theorem 507 

matroid 324, 489, 490 

maxflow—mincut theorem 64, 
171, 540 

maximal arc in a projective 
plane 367 

maximal chain in a poset 53-54, 
306, 325 

maximum distance separable 
code 248, 259, 347-348 

maximum flow 62 

maximum matching 69 

meet (greatest lower bound) 305 

Menger’s theorem 454, 455 

mesh in a graph 494 

method of differences; see 
difference methods 

metric scheme 407, 422 

Minc’s conjecture 101-103, 108 

minimal counterexample 24—26 

minimizing matrix 110ff 

minimum cut 64ff 

minimum distance of a code 211, 
244 

minimum weight 245 

minor of a graph 462 

Mobius function of a poset 333ff, 
344 

Mobius function, number 
theoretic 92-94, 333 

Mobius inversion 93, 136, 298, 
333, 336, 337, 339, 346, 347, 
370 


598 Subject Index 


Mobius plane 362 

modular combinatorial geometry 
313ff 

modular law 313 

modular complement 314 

monochromatic triangles 27-28 

monotone subsequence 55 

monovalent vertex 13 

multigraph 2 

multinomial coefficients 16, 120 

multiple edges 2 

multiplicities of a scheme 411 

multiplier of a difference set 
386ff 

Multiplier Theorem 386ff, 390ff 

mutually orthogonal Latin 
squares 300 


n-gon 6, 153, 460 

near pencil 216, 313, 354 

necklaces 94-95, 522, 525-530, 
533 

negative Latin square graphs 414 

neighborhood regular graph 277 

neighbors of a vertex 4 

net (partial geometry) 273, 280, 
286, 287, 563 

nonassociative product operation 
136 

nondegenerate quadratic form 
356 

nonembeddable design 228-229, 
240 

nonnegative matrices 111ff, 442ff 

nonseparable graph 451ff, 483ff, 
499 

nontrivial difference set 372 

Nordstrom—Robinson code 419 

normalized difference set 374, 
388 

normalized Hadamard matrix 
199 


NP-complete 9 
numerical multiplier of a 
difference set 386 


odd graph 430 

Ohm’s law 515 

optimal representation of a 
graph 437 

orbits of a group 95, 370, 407, 
523, 528, 530, 532 

order of a Hadamard matrix 199 

order of an orthogonal array 183 

order of a projective plane 225 

ordinary generating function 129 

orientable mesh 497 

orientation of a graph 21, 460 

orthogonal array 182, 294, 423 

orthogonal idempotents of a 
scheme 410 

orthogonal Latin squares 283ff, 
563 

orthogonality relations 415 

orthomorphism of an abelian 
group 297-298, 339 

orthonormal representation of a 
graph 437 

oval in a projective plane 227, 
381 

oval in a symmetric design 367 

ovoid in projective 3-space 362, 
394 


P-polynomial 422 

pairwise orthogonal Latin 
squares 285, 300 

Paley graph 262, 265 

Paley matrix 203, 252-253, 256, 
374 

Paley—Todd difference sets 
375-377 

Pappian planes 319 
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Pappus’ theorem 318 

parabolic quadratic form 359ff 

parabolic quadric 359ff 

parallel class in a Steiner system 
353 

parallel class in an affine plane 
227, 273 

parallel class of sets 537 

parallel edges 2 

parameters of a scheme 406 

parent 20 

parity check matrix of a code 
246 

parity check symbol 246 

partial fractions 153, 553 

partial geometry 271ff, 363 

partial Latin square 185 

partial match query 84 

partially balanced design 430 

partially ordered set (poset) 53, 
333ff 

partition function 155ff, 166 

partition lattice 307-308, 336, 
344-346 

partitions of a number 152ff, 
169, 327-329 

partitions of a set 49, 106-107, 
126, 537ff 

partly decomposable matrix 111 

Pasch axiom 314ff 

path in a graph 5 

paths in the X,Y plane 122, 
130, 138, 149, 552 

pentagonal numbers 157 

perfect arc 351-353, 367 

perfect code 245, 248, 250, 254, 
259, 405, 425, 427, 430 

perfect matching 44, 546 

permanent of a matrix 98ff, 549 

permanents of (0,1)-matrices 
100-104 


permanents of nonnegative 
integral matrices 105ff 

permutation matrices 48, 83, 
114, 148, 205, 221, 193, 329 

Perron—Frobenius theorem 
442-443 

perspective from a line 317 

perspective from a point 317 

Petersen graph 9, 40, 261, 263, 
276, 430, 433-434, 436, 449, 
462, 465, 477, 483, 485, 486, 
491, 492,508, 542, 570, 572 

pigeonhole principle 55, 559 

planar difference set 372, 
388-389, 396, 403 

planar graph 1, 461ff, 479, 480 

plane at infinity 367 

plane tree 139 

planted plane trees 139 

Platonic solids 465, 499, 573 

Plotkin’s bound 212 

point graph of a partial 
geometry 272 

point of a combinatorial 
geometry 303, 309 

point of an incidence structure 
215 

point of a lattice 305 

point of perspectivity 317 

Polya theory 522ff 

polygon 5, 83, 475 

polynomial scheme 422 

polyomino 132, 150 

poset (partially ordered set) 53, 
333ff 

positive and negative vectors 115 

positive semidefinite or definite 
223, 435, 439 

probabilistic method 30-32, 35, 
544 

probleme des ménages 95, 549 
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probleme des rencontres 97 

projective design 224 

projective geometry 225, 303, 
304, 313, 351 

projective plane 225, 232, 256, 
280, 287, 290, 313, 351-355, 
396 

projectively equivalent hermitian 
forms 364 

projectively equivalent quadratic 
forms 356ff 

proper coloring of a graph 24, 
341, 459 

proper drawing of a graph 461ff 

proper partial geometry 273, 280 

Priifer code 13 

pseudo-geometric graph 272, 280 


g-analogues 326, 337, 408 

q-ary code 244 

Q-polynomial 422 

quadratic form 78-80, 231, 355, 
558, 559, 566 

quadric in projective space 356ff, 
566 

quasigroup 182, 288ff 

quasiresidual design 228, 275 

quasisymmetric design 266, 275 

quotient set 372-373 


rain in Holland 97 

Raleigh quotient 434 

Ramsey’s theorem 27ff, 37, 332 

rank in a combinatorial 
geometry 309ff 

rank of a connected graph 453 

rank of a flat 309 

rank of a hermitian form 364 

rank of a quadratic form 356 

rational function 132 

real projective plane 226, 492, 
496 


recoloring 24, 467 

recurrence relation 124, 134ff, 
159, 329 

recursive method 201, 233 

Reed—Muller codes 210-218, 
361, 560 

refinement order on partitions 
307 

regions of an embedding 463, 
472 

regular bipartite graph 44 

regular graph 4, 40, 149, 422, 
445 

regular Hadamard matrix 208, 
218, 226, 375 

repeated blocks 215, 223 

repetition 177 

repetition code 209, 245 

replication number 219, 240, 430 

residual design 228, 275 

resolvable Steiner system 353 

resolvable transversal designs 
291 

Riemann zeta function 93 

Ringel-Youngs theorem 494 

rooted trees and forests 15, 19, 
145-146 

rotating drum problem 71 

rotations of the cube 526, 577 

row 182 

row-complete 196 

row-sum 169 


saturated edge 63ff 

scheme 405f 

Schlaefli graph 275 

Schroder—Bernstein Theorem 51 

SDR theorem; see Hall’s 
theorem 

SDR’s and permanents 100 

self-conjugate partitions 162 
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self-dual code 246, 396ff 

self-orthogonal codes 246, 396ff 

semi-simple algebra 402 

semimodular lattice 306ff, 342 

semimodular law 311 

semiregularity 44 

Shannon capacity of a graph 
436-437 

sieve methods 96 

signless Stirling numbers 123, 
534 

simple answers 288 

simple closed path 5 

simple design 215ff, 223 

simple difference set 372 

simple digraph 2 

simple graph 2 

simple path 5, 25 

simplex code 247 

Singer difference sets 378-380 

Singer’s theorem 377, 381 

Singleton bound 247 

sink in a transportation network 
61 

Six Color Theorem 467 

Skolem sequences 241 

slope of a Ferrers diagram 158 

Smith normal form 399 

source in a transportation 
network 61 

spanning arborescence 16ff, 512, 
543 

spanning subgraph 4 

spanning subset in combinatorial 
geometries 310 

spanning tree 12ff, 81, 479, 
507ff, 519, 543, 548, 572, 574 

special network 517 

special path 63ff 

Sperner’s theorem 54, 56, 59, 
326, 484 


sphere packing bound 245, 419, 
425 

spherical 2-distance set 269 

spread 320, 330 

square design 224 

squared rectangle 516ff 

squared square 507 

standard tableaux 162 

Stanton-Sprott difference sets 
376 

Steiner system 216ff, 221, 223, 
303, 351ff, 362, 364-365 

Steiner triple system 233-241, 
276, 278 

Stirling numbers 123-127, 534 

Stirling’s formula 32, 107, 108, 
187, 208 

strength of a flow 62 

strength of an orthogonal array 
423 

strong product of graphs 436 

strongly connected digraph 442 

strongly regular graph 261ff, 
405, 407, 449 

sub-Latin square 185 

subdivision of a graph 452ff 

subgeometry 304 

subgraph 4 

subplane of a projective plane 
353ff 

subsquare of a Latin square 185 

subsquares of orthogonal Latin 
squares 298 

substitution of power series 579 

substitution principle 114, 117 

substructure of an incidence 
structure 353 

support of a codeword 249, 475 

supporting hyperplane 446 

surjections, number of 90, 125, 
337 
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Sylvester’s law 96, 432 

symbol 182 

symmetric chain 55, 57, 58 

symmetric designs 224, 299, 367, 
369, 396ff, 568 

symmetric functions 535 

symmetry code 252 

system of distinct representatives 
(SDR) 43ff, 100, 106-108, 186 

system of walks in a graph 487ff 


t-designs 216ff, 223, 249-253, 
255, 266, 330, 405, 423 

tail of a directed edge 2, 460 

Tait coloring of a graph 481-483 

tangent 367 

tennis matches 563 

ternary code 244 

ternary Golay code 256, 417 

tight design 223, 425, 428 

torus 492 

total order 53 

totally isotropic 397 

totally unimodular matrices 510 

tournament 33, 432 

transitive tournament 33 

translation plane 320, 331 

transportation network 61ff, 
170, 538 

transversal design 273, 290, 563 

tree 6, 12-22, 77-80, 459 

triangles in a graph 27, 37-39, 
543, 544 

triangular embedding of a graph 
500, 505 

triangular graph 262, 270, 275 

triangular mesh 494 

trifferent code 258 

trivalent graph 481-482, 546, 
551, 574 

trivalent tree 139 


Turan’s theorem 37ff 

Tutte connectivity 451, 457, 458 

type of a partition of a set 346, 
526 

type of a permutation 370, 
525-528 


unipotent Latin square 288 
unital 365 
unordered partition 152 


valency or valence of a vertex 4 

Van der Waerden conjecture 
104, 108, 186, 197 

Vandermonde determinant 163 

Vandermonde matrix 251 

varieties 240 

vertex condition 497 

vertex connectivity 451 

vertices of a graph 1 


van der Waerden conjecture 104, 
110-118, 186, 197 

walk in a graph 5 

walks in the X,Y plane 122, 
130, 138, 149, 552 

weight of a codeword 244 

weight enumerator of a code 248 

weighted graph 17, 434 

Weisner’s theorem 335, 339, 344 

Whitney dual of a graph 472ff 

Whitney’s theorem 479, 480 

Williamson’s method 204, 206 

Wilson—Petrenjuk inequality 222 

Witt design 223, 255, 256, 266 

write-once memory 237 


Young diagram 156 
Young tableau 162-167 


zeta function of a poset 333 
zeta function, Riemann 93, 97 


